subscribe to arXiv mailings

Beamforming Optimization for Continuous Aperture Array (CAPA)-based Communications

Authors: Zhaolin Wang, Chongjun Ouyang, Yuanwei Liu

Abstract: The beamforming optimization in continuous aperture array (CAPA)-based multi-user communications is studied. In contrast to conventional spatially discrete antenna arrays, CAPAs can exploit the full spatial degrees of freedoms (DoFs) by emitting information-bearing electromagnetic (EM) wave through continuous source current distributed across the aperture. Nevertheless, such operation renders the… ▽ More The beamforming optimization in continuous aperture array (CAPA)-based multi-user communications is studied. In contrast to conventional spatially discrete antenna arrays, CAPAs can exploit the full spatial degrees of freedoms (DoFs) by emitting information-bearing electromagnetic (EM) wave through continuous source current distributed across the aperture. Nevertheless, such operation renders the beamforming optimization problem as a non-convex integral-based functional programming problem, which is challenging for conventional discrete optimization methods. A couple of low-complexity approaches are proposed to solve the functional programming problem. 1) Calculus of variations (CoV)-based approach: Closed-form structure of the optimal continuous source patterns are derived based on CoV, inspiring a low-complexity integral-free iterative algorithm for solving the functional programming problem. 2) Correlation-based zero-forcing (Corr-ZF) approach: Closed-form ZF source current patterns that completely eliminate the interuser interference are derived based on the channel correlations. By using these patterns, the original functional programming problem is transformed to a simple power allocation problem, which can be solved using the classical water-filling approach with reduced complexity. Our numerical results validate the effectiveness of the proposed designs and reveal that: i) compared to the state-of-the-art Fourier-based discretization approach, the proposed CoV-based approach not only improves communication performance but also reduces computational complexity by up to hundreds of times for large CAPA apertures and high frequencies, and ii) the proposed Corr-ZF approach achieves asymptotically optimal performance compared to the CoV-based approach. △ Less

Submitted 17 October, 2024; originally announced October 2024.

Comments: 13 pages, 9 figures

arXiv:2410.12803 [pdf, other]

Developing Guidelines for Functionally-Grounded Evaluation of Explainable Artificial Intelligence using Tabular Data

Authors: Mythreyi Velmurugan, Chun Ouyang, Yue Xu, Renuka Sindhgatta, Bemali Wickramanayake, Catarina Moreira

Abstract: Explainable Artificial Intelligence (XAI) techniques are used to provide transparency to complex, opaque predictive models. However, these techniques are often designed for image and text data, and it is unclear how fit-for-purpose they are when applied to tabular data. As XAI techniques are rarely evaluated in settings with tabular data, the applicability of existing evaluation criteria and metho… ▽ More Explainable Artificial Intelligence (XAI) techniques are used to provide transparency to complex, opaque predictive models. However, these techniques are often designed for image and text data, and it is unclear how fit-for-purpose they are when applied to tabular data. As XAI techniques are rarely evaluated in settings with tabular data, the applicability of existing evaluation criteria and methods are also unclear and needs (re-)examination. For example, some works suggest that evaluation methods may unduly influence the evaluation results when using tabular data. This lack of clarity on evaluation procedures can lead to reduced transparency and ineffective use of XAI techniques in real world settings. In this study, we examine literature on XAI evaluation to derive guidelines on functionally-grounded assessment of local, post hoc XAI techniques. We identify 20 evaluation criteria and associated evaluation methods, and derive guidelines on when and how each criterion should be evaluated. We also identify key research gaps to be addressed by future work. Our study contributes to the body of knowledge on XAI evaluation through in-depth examination of functionally-grounded XAI evaluation protocols, and has laid the groundwork for future research on XAI evaluation. △ Less

Submitted 30 September, 2024; originally announced October 2024.

arXiv:2410.11428 [pdf, other]

CTA-Net: A CNN-Transformer Aggregation Network for Improving Multi-Scale Feature Extraction

Authors: Chunlei Meng, Jiacheng Yang, Wei Lin, Bowen Liu, Hongda Zhang, chun ouyang, Zhongxue Gan

Abstract: Convolutional neural networks (CNNs) and vision transformers (ViTs) have become essential in computer vision for local and global feature extraction. However, aggregating these architectures in existing methods often results in inefficiencies. To address this, the CNN-Transformer Aggregation Network (CTA-Net) was developed. CTA-Net combines CNNs and ViTs, with transformers capturing long-range dep… ▽ More Convolutional neural networks (CNNs) and vision transformers (ViTs) have become essential in computer vision for local and global feature extraction. However, aggregating these architectures in existing methods often results in inefficiencies. To address this, the CNN-Transformer Aggregation Network (CTA-Net) was developed. CTA-Net combines CNNs and ViTs, with transformers capturing long-range dependencies and CNNs extracting localized features. This integration enables efficient processing of detailed local and broader contextual information. CTA-Net introduces the Light Weight Multi-Scale Feature Fusion Multi-Head Self-Attention (LMF-MHSA) module for effective multi-scale feature integration with reduced parameters. Additionally, the Reverse Reconstruction CNN-Variants (RRCV) module enhances the embedding of CNNs within the transformer architecture. Extensive experiments on small-scale datasets with fewer than 100,000 samples show that CTA-Net achieves superior performance (TOP-1 Acc 86.76\%), fewer parameters (20.32M), and greater efficiency (FLOPs 2.83B), making it a highly efficient and lightweight solution for visual tasks on small-scale datasets (fewer than 100,000). △ Less

Submitted 15 October, 2024; originally announced October 2024.

Comments: 9 pages, 3 figures

arXiv:2410.04351 [pdf]

An asymmetric surface coating strategy for promotes rapid endothelialization in the rabbit carotid artery

Authors: Lili Tan, Zhiyi Ye, Suhua Yu, Jinxuan Wang, Chenxi Ouyang, Zhengcai Zhang, Robert Guidoin, Guixue Wang

Abstract: Studying surface modification has long been a key area for enhancing the effects of vascular stents after surgery. The study aimed to develop an asymmetric drug-eluting stent (ADES) with differential drug loading on its inner and outer surfaces, hypothesizing that this design would enhance drug delivery efficacy for percutaneous coronary interventions (PCIs) compared to uniformly coated drug-eluti… ▽ More Studying surface modification has long been a key area for enhancing the effects of vascular stents after surgery. The study aimed to develop an asymmetric drug-eluting stent (ADES) with differential drug loading on its inner and outer surfaces, hypothesizing that this design would enhance drug delivery efficacy for percutaneous coronary interventions (PCIs) compared to uniformly coated drug-eluting stents (UDES). An ultrasonic atomization spraying device was utilized to fabricate the ADES, which was subsequently evaluated for drug release patterns, hemocompatibility, and biocompatibility. In vitro, assessments demonstrated favorable hemocompatibility and showed targeted drug delivery capabilities of ADES within artificial blood vessels. Furthermore, in vivo testing using a rabbit carotid artery model revealed significant endothelialization on stented segments treated with the ADES. These findings suggest that the ADES holds promise as a minimally invasive platform for improving cardiovascular disease treatment outcomes by addressing thrombus formation and neointima proliferation more effectively than traditional stents. △ Less

Submitted 6 October, 2024; originally announced October 2024.

Comments: 24 pages,7 figures, 1 table

arXiv:2409.09796 [pdf, other]

Universal Topology Refinement for Medical Image Segmentation with Polynomial Feature Synthesis

Authors: Liu Li, Hanchun Wang, Matthew Baugh, Qiang Ma, Weitong Zhang, Cheng Ouyang, Daniel Rueckert, Bernhard Kainz

Abstract: Although existing medical image segmentation methods provide impressive pixel-wise accuracy, they often neglect topological correctness, making their segmentations unusable for many downstream tasks. One option is to retrain such models whilst including a topology-driven loss component. However, this is computationally expensive and often impractical. A better solution would be to have a versatile… ▽ More Although existing medical image segmentation methods provide impressive pixel-wise accuracy, they often neglect topological correctness, making their segmentations unusable for many downstream tasks. One option is to retrain such models whilst including a topology-driven loss component. However, this is computationally expensive and often impractical. A better solution would be to have a versatile plug-and-play topology refinement method that is compatible with any domain-specific segmentation pipeline. Directly training a post-processing model to mitigate topological errors often fails as such models tend to be biased towards the topological errors of a target segmentation network. The diversity of these errors is confined to the information provided by a labelled training set, which is especially problematic for small datasets. Our method solves this problem by training a model-agnostic topology refinement network with synthetic segmentations that cover a wide variety of topological errors. Inspired by the Stone-Weierstrass theorem, we synthesize topology-perturbation masks with randomly sampled coefficients of orthogonal polynomial bases, which ensures a complete and unbiased representation. Practically, we verified the efficiency and effectiveness of our methods as being compatible with multiple families of polynomial bases, and show evidence that our universal plug-and-play topology refinement network outperforms both existing topology-driven learning-based and post-processing methods. We also show that combining our method with learning-based models provides an effortless add-on, which can further improve the performance of existing approaches. △ Less

Submitted 15 September, 2024; originally announced September 2024.

Comments: Accepted by the 27th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2024)

arXiv:2408.13948 [pdf, ps, other]

Diversity and Multiplexing for Continuous Aperture Array (CAPA)-Based Communications

Authors: Chongjun Ouyang, Zhaolin Wang, Xingqi Zhang, Yuanwei Liu

Abstract: The performance of multiplexing and diversity achieved by continuous aperture arrays (CAPAs) over fading channels is analyzed. Angular-domain fading models are derived for CAPA-based multiple-input single-output (MISO), single-input multiple-output (SIMO), and multiple-input multiple-output (MIMO) channels using the Fourier relationship between the spatial response and its angular-domain counterpa… ▽ More The performance of multiplexing and diversity achieved by continuous aperture arrays (CAPAs) over fading channels is analyzed. Angular-domain fading models are derived for CAPA-based multiple-input single-output (MISO), single-input multiple-output (SIMO), and multiple-input multiple-output (MIMO) channels using the Fourier relationship between the spatial response and its angular-domain counterpart. Building on these models, angular-domain transmission frameworks are proposed to facilitate CAPA-based communications, under which the performance of multiplexing and diversity is analyzed. 1) For SIMO and MISO channels, closed-form expressions are derived for the average data rate (ADR) and outage probability (OP). Additionally, asymptotic analyses are performed in the high signal-to-noise ratio (SNR) regime to unveil the maximal multiplexing gain and maximal diversity gain. The diversity-multiplexing trade-off (DMT) is also characterized, along with the array gain within the DMT framework. 2) For MIMO channels, high-SNR approximations are derived for the ADR and OP, based on which the DMT and associated array gain are revealed. The performance of CAPAs is further compared with that of conventional spatially discrete arrays (SPDAs) to highlight the superiority of CAPAs. The analytical and numerical results demonstrate that: i) compared to SPDAs, CAPAs achieve a lower OP and higher ADR, resulting in better spectral efficiency; ii) CAPAs achieve the same DMT as SPDAs with half-wavelength antenna spacing while attaining a larger array gain; and iii) CAPAs achieve a better DMT than SPDAs with antenna spacing greater than half a wavelength. △ Less

Submitted 25 August, 2024; originally announced August 2024.

Comments: 40 pages

arXiv:2408.10706 [pdf, ps, other]

Performance Analysis of Physical Layer Security: From Far-Field to Near-Field

Authors: Boqun Zhao, Chongjun Ouyang, Xingqi Zhang, Yuanwei Liu

Abstract: The secrecy performance in both near-field and far-field communications is analyzed using two fundamental metrics: the secrecy capacity under a power constraint and the minimum power requirement to achieve a specified secrecy rate target. 1) For the secrecy capacity, a closed-form expression is derived under a discrete-time memoryless setup. This expression is further analyzed under several far-fi… ▽ More The secrecy performance in both near-field and far-field communications is analyzed using two fundamental metrics: the secrecy capacity under a power constraint and the minimum power requirement to achieve a specified secrecy rate target. 1) For the secrecy capacity, a closed-form expression is derived under a discrete-time memoryless setup. This expression is further analyzed under several far-field and near-field channel models, and the capacity scaling law is revealed by assuming an infinitely large transmit array and an infinitely high power. A novel concept of "depth of insecurity" is proposed to evaluate the secrecy performance achieved by near-field beamfocusing. It is demonstrated that increasing the number of transmit antennas reduces this depth and thus improves the secrecy performance. 2) Regarding the minimum required power, a closed-form expression is derived and analyzed within far-field and near-field scenarios. Asymptotic analyses are performed by setting the number of transmit antennas to infinity to unveil the power scaling law. Numerical results are provided to demonstrate that: i) compared to far-field communications, near-field communications expand the areas where secure transmission is feasible, specifically when the eavesdropper is located in the same direction as the intended receiver; ii) as the number of transmit antennas increases, neither the secrecy capacity nor the minimum required power scales or vanishes unboundedly, adhering to the principle of energy conservation. △ Less

Submitted 20 August, 2024; originally announced August 2024.

arXiv:2408.00952 [pdf, ps, other]

A Primer on Near-Field Communications for Next-Generation Multiple Access

Authors: Chongjun Ouyang, Zhaolin Wang, Yan Chen, Xidong Mu, Peiying Zhu

Abstract: Multiple-antenna technologies are advancing toward the development of extremely large aperture arrays and the utilization of extremely high frequencies, driving the progress of next-generation multiple access (NGMA). This evolution is accompanied by the emergence of near-field communications (NFC), characterized by spherical-wave propagation, which introduces additional range dimensions to the cha… ▽ More Multiple-antenna technologies are advancing toward the development of extremely large aperture arrays and the utilization of extremely high frequencies, driving the progress of next-generation multiple access (NGMA). This evolution is accompanied by the emergence of near-field communications (NFC), characterized by spherical-wave propagation, which introduces additional range dimensions to the channel and enhances system throughput. In this context, a tutorial-based primer on NFC is presented, emphasizing its applications in multiuser communications and multiple access (MA). The following areas are investigated: \romannumeral1) the commonly used near-field channel models are reviewed along with their simplifications under various near-field conditions. \romannumeral2) Building upon these models, the information-theoretic capacity limits of NFC-MA are analyzed, including the derivation of sum-rate capacity and capacity region, and their upper limits for both downlink and uplink scenarios. \romannumeral3) A detailed investigation of near-field multiuser beamforming design is presented, offering low-complexity and effective NFC-MA design methodologies in both the spatial and wavenumber (angular) domains. Throughout these investigations, near-field MA is compared with its far-field counterpart to highlight its superiority and flexibility in terms of interference management, thereby laying the groundwork for achieving NGMA. △ Less

Submitted 8 August, 2024; v1 submitted 1 August, 2024; originally announced August 2024.

Comments: 34 pages

arXiv:2407.21055 [pdf, other]

Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications

Authors: Cui Long, Yongbin Liu, Chunping Ouyang, Ying Yu

Abstract: Large Language Models (LLMs) have exhibited remarkable proficiency in natural language understanding, prompting extensive exploration of their potential applications across diverse domains. In the medical domain, open-source LLMs have demonstrated moderate efficacy following domain-specific fine-tuning; however, they remain substantially inferior to proprietary models such as GPT-4 and GPT-3.5. Th… ▽ More Large Language Models (LLMs) have exhibited remarkable proficiency in natural language understanding, prompting extensive exploration of their potential applications across diverse domains. In the medical domain, open-source LLMs have demonstrated moderate efficacy following domain-specific fine-tuning; however, they remain substantially inferior to proprietary models such as GPT-4 and GPT-3.5. These open-source models encounter limitations in the comprehensiveness of domain-specific knowledge and exhibit a propensity for 'hallucinations' during text generation. To mitigate these issues, researchers have implemented the Retrieval-Augmented Generation (RAG) approach, which augments LLMs with background information from external knowledge bases while preserving the model's internal parameters. However, document noise can adversely affect performance, and the application of RAG in the medical field remains in its nascent stages. This study presents the Bailicai framework: a novel integration of retrieval-augmented generation with large language models optimized for the medical domain. The Bailicai framework augments the performance of LLMs in medicine through the implementation of four sub-modules. Experimental results demonstrate that the Bailicai approach surpasses existing medical domain LLMs across multiple medical benchmarks and exceeds the performance of GPT-3.5. Furthermore, the Bailicai method effectively attenuates the prevalent issue of hallucinations in medical applications of LLMs and ameliorates the noise-related challenges associated with traditional RAG techniques when processing irrelevant or pseudo-relevant documents. △ Less

Submitted 24 July, 2024; originally announced July 2024.

arXiv:2407.11463 [pdf, other]

Investigating Imperceptibility of Adversarial Attacks on Tabular Data: An Empirical Analysis

Authors: Zhipeng He, Chun Ouyang, Laith Alzubaidi, Alistair Barros, Catarina Moreira

Abstract: Adversarial attacks are a potential threat to machine learning models by causing incorrect predictions through imperceptible perturbations to the input data. While these attacks have been extensively studied in unstructured data like images, applying them to tabular data, poses new challenges. These challenges arise from the inherent heterogeneity and complex feature interdependencies in tabular d… ▽ More Adversarial attacks are a potential threat to machine learning models by causing incorrect predictions through imperceptible perturbations to the input data. While these attacks have been extensively studied in unstructured data like images, applying them to tabular data, poses new challenges. These challenges arise from the inherent heterogeneity and complex feature interdependencies in tabular data, which differ from the image data. To account for this distinction, it is necessary to establish tailored imperceptibility criteria specific to tabular data. However, there is currently a lack of standardised metrics for assessing the imperceptibility of adversarial attacks on tabular data. To address this gap, we propose a set of key properties and corresponding metrics designed to comprehensively characterise imperceptible adversarial attacks on tabular data. These are: proximity to the original input, sparsity of altered features, deviation from the original data distribution, sensitivity in perturbing features with narrow distribution, immutability of certain features that should remain unchanged, feasibility of specific feature values that should not go beyond valid practical ranges, and feature interdependencies capturing complex relationships between data attributes. We evaluate the imperceptibility of five adversarial attacks, including both bounded attacks and unbounded attacks, on tabular data using the proposed imperceptibility metrics. The results reveal a trade-off between the imperceptibility and effectiveness of these attacks. The study also identifies limitations in current attack algorithms, offering insights that can guide future research in the area. The findings gained from this empirical analysis provide valuable direction for enhancing the design of adversarial attack algorithms, thereby advancing adversarial machine learning on tabular data. △ Less

Submitted 4 October, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

Comments: 36 pages

arXiv:2407.11158 [pdf, other]

Physics-embedded Fourier Neural Network for Partial Differential Equations

Authors: Qingsong Xu, Nils Thuerey, Yilei Shi, Jonathan Bamber, Chaojun Ouyang, Xiao Xiang Zhu

Abstract: We consider solving complex spatiotemporal dynamical systems governed by partial differential equations (PDEs) using frequency domain-based discrete learning approaches, such as Fourier neural operators. Despite their widespread use for approximating nonlinear PDEs, the majority of these methods neglect fundamental physical laws and lack interpretability. We address these shortcomings by introduci… ▽ More We consider solving complex spatiotemporal dynamical systems governed by partial differential equations (PDEs) using frequency domain-based discrete learning approaches, such as Fourier neural operators. Despite their widespread use for approximating nonlinear PDEs, the majority of these methods neglect fundamental physical laws and lack interpretability. We address these shortcomings by introducing Physics-embedded Fourier Neural Networks (PeFNN) with flexible and explainable error control. PeFNN is designed to enforce momentum conservation and yields interpretable nonlinear expressions by utilizing unique multi-scale momentum-conserving Fourier (MC-Fourier) layers and an element-wise product operation. The MC-Fourier layer is by design translation- and rotation-invariant in the frequency domain, serving as a plug-and-play module that adheres to the laws of momentum conservation. PeFNN establishes a new state-of-the-art in solving widely employed spatiotemporal PDEs and generalizes well across input resolutions. Further, we demonstrate its outstanding performance for challenging real-world applications such as large-scale flood simulations. △ Less

Submitted 15 July, 2024; originally announced July 2024.

Comments: 29 pages,18 figures

arXiv:2407.08227 [pdf, other]

DALL-M: Context-Aware Clinical Data Augmentation with LLMs

Authors: Chihcheng Hsieh, Catarina Moreira, Isabel Blanco Nobre, Sandra Costa Sousa, Chun Ouyang, Margot Brereton, Joaquim Jorge, Jacinto C. Nascimento

Abstract: X-ray images are vital in medical diagnostics, but their effectiveness is limited without clinical context. Radiologists often find chest X-rays insufficient for diagnosing underlying diseases, necessitating comprehensive clinical features and data integration. We present a novel framework to enhance the clinical context through augmentation techniques with clinical tabular data, thereby improving… ▽ More X-ray images are vital in medical diagnostics, but their effectiveness is limited without clinical context. Radiologists often find chest X-rays insufficient for diagnosing underlying diseases, necessitating comprehensive clinical features and data integration. We present a novel framework to enhance the clinical context through augmentation techniques with clinical tabular data, thereby improving its applicability and reliability in AI medical diagnostics. We introduce a pioneering approach to clinical data augmentation that employs large language models to generate patient contextual synthetic data. This methodology is crucial for training more robust deep learning models in healthcare. It preserves the integrity of real patient data while enriching the dataset with contextually relevant synthetic features, significantly enhancing model performance. Our methodology, termed DALL-M, uses a three-phase feature generation process: (i)clinical context storage, (ii)expert query generation, and (iii)context-aware feature augmentation. DALL-M generates new, clinically relevant features by synthesizing chest X-ray images and reports. Applied to 799 cases using nine features from the MIMIC-IV dataset, it created an augmented set of 91 features. This is the first work to generate contextual values for patients' X-ray reports. Specifically, we provide (i)the capacity of LLMs to generate contextual synthetic values for existing clinical features and (ii)their ability to create entirely new clinically relevant features. Empirical validation with machine learning models showed significant performance improvements. Incorporating augmented features increased the F1 score by 16.5% and Precision and Recall by approximately 25%. DALL-M addresses a critical gap in clinical data augmentation, offering a robust framework for generating contextually enriched datasets. △ Less

Submitted 7 October, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

Comments: we introduce a pioneering approach to clinical data augmentation that employs large language models (LLMs) to generate patient contextual synthetic data. It preserves the integrity of real patient data while enriching the dataset with contextually relevant synthetic features, significantly enhancing model performance

ACM Class: I.5.1; J.3; H.3.3; I.2.7

arXiv:2406.15056 [pdf, ps, other]

Continuous Aperture Array (CAPA)-Based Wireless Communications: Capacity Characterization

Authors: Boqun Zhao, Chongjun Ouyang, Xingqi Zhang, Yuanwei Liu

Abstract: The capacity limits of continuous-aperture array (CAPA)-based wireless communications are characterized. To this end, an analytically tractable transmission framework is established for both uplink and downlink CAPA systems. Based on this framework, closed-form expressions for the single-user channel capacity are derived. The results are further extended to a multiuser case by characterizing the c… ▽ More The capacity limits of continuous-aperture array (CAPA)-based wireless communications are characterized. To this end, an analytically tractable transmission framework is established for both uplink and downlink CAPA systems. Based on this framework, closed-form expressions for the single-user channel capacity are derived. The results are further extended to a multiuser case by characterizing the capacity limits of a two-user channel and proposing the associated capacity-achieving decoding and encoding schemes. 1) For the uplink case, the sum-rate capacity and capacity region, as well as the capacity-achieving detectors, are derived. 2) For the downlink case, the uplink-downlink duality is established by deriving the uplink-to-downlink and downlink-to-uplink transformations under the same power constraint, based on which the optimal power allocation policy and the achieved sum-rate capacity and capacity region are characterized. To gain further insights, several case studies are presented by specializing the derived results into various array structures, including the planar CAPA, linear CAPA, and planar spatially discrete array (SPDA). Numerical results are provided to reveal that: i) the channel capacity achieved by CAPAs converges towards a finite upper bound as the aperture size increases; and ii) CAPAs offer significant capacity gains over the conventional SPDAs. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.13652 [pdf, other]

Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics

Authors: Weitong Zhang, Chengqi Zang, Liu Li, Sarah Cechnicka, Cheng Ouyang, Bernhard Kainz

Abstract: Inverse problems describe the process of estimating the causal factors from a set of measurements or data. Mapping of often incomplete or degraded data to parameters is ill-posed, thus data-driven iterative solutions are required, for example when reconstructing clean images from poor signals. Diffusion models have shown promise as potent generative tools for solving inverse problems due to their… ▽ More Inverse problems describe the process of estimating the causal factors from a set of measurements or data. Mapping of often incomplete or degraded data to parameters is ill-posed, thus data-driven iterative solutions are required, for example when reconstructing clean images from poor signals. Diffusion models have shown promise as potent generative tools for solving inverse problems due to their superior reconstruction quality and their compatibility with iterative solvers. However, most existing approaches are limited to linear inverse problems represented as Stochastic Differential Equations (SDEs). This simplification falls short of addressing the challenging nature of real-world problems, leading to amplified cumulative errors and biases. We provide an explanation for this gap through the lens of measure-preserving dynamics of Random Dynamical Systems (RDS) with which we analyse Temporal Distribution Discrepancy and thus introduce a theoretical framework based on RDS for SDE diffusion models. We uncover several strategies that inherently enhance the stability and generalizability of diffusion models for inverse problems and introduce a novel score-based diffusion framework, the \textbf{D}ynamics-aware S\textbf{D}E \textbf{D}iffusion \textbf{G}enerative \textbf{M}odel (D$^3$GM). The \textit{Measure-preserving property} can return the degraded measurement to the original state despite complex degradation with the RDS concept of \textit{stability}. Our extensive experimental results corroborate the effectiveness of D$^3$GM across multiple benchmarks including a prominent application for inverse problems, magnetic resonance imaging. Code and data will be publicly available. △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2405.16694 [pdf, ps, other]

Aperture Selection for CAP Arrays (CAPAs)

Authors: Chongjun Ouyang, Yuanwei Liu, Xingqi Zhang

Abstract: The concept of aperture selection is proposed for continuous aperture array (CAPA)-based communications. The achieved performance is analyzed in an uplink scenario by considering both line-of-sight (LoS) and non-line-of-sight (NLoS) scenarios. In the LoS scenario, the optimal selection strategy is demonstrated to follow the nearest neighbor criterion, and the resulting signal-to-noise ratio (SNR)… ▽ More The concept of aperture selection is proposed for continuous aperture array (CAPA)-based communications. The achieved performance is analyzed in an uplink scenario by considering both line-of-sight (LoS) and non-line-of-sight (NLoS) scenarios. In the LoS scenario, the optimal selection strategy is demonstrated to follow the nearest neighbor criterion, and the resulting signal-to-noise ratio (SNR) is analyzed. In the NLoS scenario, the achieved outage probability along with the diversity order is revealed. Numerical results are provided to demonstrate that aperture selection effectively maintains satisfactory performance by leveraging selection diversity while simultaneously reducing the implementation complexity of CAPAs. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: 6 pages

arXiv:2405.16690 [pdf, ps, other]

On the Performance of Continuous Aperture Array (CAPA)-Based Wireless Communications

Authors: Chongjun Ouyang, Yuanwei Liu, Xingqi Zhang

Abstract: The performance of continuous aperture array (CAPA)-based wireless communications is analyzed in an uplink scenario. An analytical framework is proposed to characterize uplink CAPA-based transmission using electromagnetic field theories. On this basis, new expressions are derived for the channel capacity in a single-user scenario and the sum-rate capacity in a multiuser scenario, along with the ca… ▽ More The performance of continuous aperture array (CAPA)-based wireless communications is analyzed in an uplink scenario. An analytical framework is proposed to characterize uplink CAPA-based transmission using electromagnetic field theories. On this basis, new expressions are derived for the channel capacity in a single-user scenario and the sum-rate capacity in a multiuser scenario, along with the capacity-achieving decoding schemes. These findings are proved to differ greatly from those established for conventional spatially discrete (SPD) arrays. Numerical results are provided to demonstrate that CAPA offers significant capacity gains compared to the SPD array. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: 6 pages

arXiv:2405.16460 [pdf, other]

Probabilistic Contrastive Learning with Explicit Concentration on the Hypersphere

Authors: Hongwei Bran Li, Cheng Ouyang, Tamaz Amiranashvili, Matthew S. Rosen, Bjoern Menze, Juan Eugenio Iglesias

Abstract: Self-supervised contrastive learning has predominantly adopted deterministic methods, which are not suited for environments characterized by uncertainty and noise. This paper introduces a new perspective on incorporating uncertainty into contrastive learning by embedding representations within a spherical space, inspired by the von Mises-Fisher distribution (vMF). We introduce an unnormalized form… ▽ More Self-supervised contrastive learning has predominantly adopted deterministic methods, which are not suited for environments characterized by uncertainty and noise. This paper introduces a new perspective on incorporating uncertainty into contrastive learning by embedding representations within a spherical space, inspired by the von Mises-Fisher distribution (vMF). We introduce an unnormalized form of vMF and leverage the concentration parameter, kappa, as a direct, interpretable measure to quantify uncertainty explicitly. This approach not only provides a probabilistic interpretation of the embedding space but also offers a method to calibrate model confidence against varying levels of data corruption and characteristics. Our empirical results demonstrate that the estimated concentration parameter correlates strongly with the degree of unforeseen data corruption encountered at test time, enables failure analysis, and enhances existing out-of-distribution detection methods. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: technical report

arXiv:2405.14029 [pdf, ps, other]

Analog Beamforming Enabled Multicasting: Finite-Alphabet Inputs and Statistical CSI

Authors: Yanjun Wu, Zhong Xie, Zhuochen Xie, Chongjun Ouyang, Xuwen Liang

Abstract: The average multicast rate (AMR) is analyzed in a multicast channel utilizing analog beamforming with finite-alphabet inputs, considering statistical channel state information (CSI). New expressions for the AMR are derived for non-cooperative and cooperative multicasting scenarios. Asymptotic analyses are conducted in the high signal-to-noise ratio regime to derive the array gain and diversity ord… ▽ More The average multicast rate (AMR) is analyzed in a multicast channel utilizing analog beamforming with finite-alphabet inputs, considering statistical channel state information (CSI). New expressions for the AMR are derived for non-cooperative and cooperative multicasting scenarios. Asymptotic analyses are conducted in the high signal-to-noise ratio regime to derive the array gain and diversity order. It is proved that the analog beamformer influences the AMR through its array gain, leading to the proposal of efficient beamforming algorithms aimed at maximizing the array gain to enhance the AMR. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: 5 pages

arXiv:2405.10246 [pdf, other]

A Foundation Model for Brain Lesion Segmentation with Mixture of Modality Experts

Authors: Xinru Zhang, Ni Ou, Berke Doga Basaran, Marco Visentin, Mengyun Qiao, Renyang Gu, Cheng Ouyang, Yaou Liu, Paul M. Matthew, Chuyang Ye, Wenjia Bai

Abstract: Brain lesion segmentation plays an essential role in neurological research and diagnosis. As brain lesions can be caused by various pathological alterations, different types of brain lesions tend to manifest with different characteristics on different imaging modalities. Due to this complexity, brain lesion segmentation methods are often developed in a task-specific manner. A specific segmentation… ▽ More Brain lesion segmentation plays an essential role in neurological research and diagnosis. As brain lesions can be caused by various pathological alterations, different types of brain lesions tend to manifest with different characteristics on different imaging modalities. Due to this complexity, brain lesion segmentation methods are often developed in a task-specific manner. A specific segmentation model is developed for a particular lesion type and imaging modality. However, the use of task-specific models requires predetermination of the lesion type and imaging modality, which complicates their deployment in real-world scenarios. In this work, we propose a universal foundation model for 3D brain lesion segmentation, which can automatically segment different types of brain lesions for input data of various imaging modalities. We formulate a novel Mixture of Modality Experts (MoME) framework with multiple expert networks attending to different imaging modalities. A hierarchical gating network combines the expert predictions and fosters expertise collaboration. Furthermore, we introduce a curriculum learning strategy during training to avoid the degeneration of each expert network and preserve their specialization. We evaluated the proposed method on nine brain lesion datasets, encompassing five imaging modalities and eight lesion types. The results show that our model outperforms state-of-the-art universal models and provides promising generalization to unseen datasets. △ Less

Submitted 16 July, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: The work has been early accepted by MICCAI 2024

arXiv:2405.07281 [pdf, ps, other]

Movable Antennas Aided Multicast MISO Communication Systems

Authors: Zhenqiao Cheng, Nanxi Li, Ruizhe Long, Jianchi Zhu, Chongjun Ouyang, Peng Chen

Abstract: A novel multicast communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the transmission rate. Specifically, an MA-assisted two-user multicast multiple-input single-input system is considered. The joint optimization of the transmit beamforming vector and transmit MA positions is studied by modeling the motion of the MA element… ▽ More A novel multicast communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the transmission rate. Specifically, an MA-assisted two-user multicast multiple-input single-input system is considered. The joint optimization of the transmit beamforming vector and transmit MA positions is studied by modeling the motion of the MA elements as discrete movements. A low-complexity greedy search-based algorithm is proposed to tackle this non-convex inter-programming problem. A branch-and-bound (BAB)-based method is proposed to achieve the optimal multicast rate with a reduced time complexity than the brute-force search by assuming the two users suffer similar line-of-sight path losses. Numerical results reveal that the proposed MA systems significantly improve the multicast rate compared to conventional fixed-position antennas (FPAs)-based systems. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Comments: 5 pages

arXiv:2405.05387 [pdf, ps, other]

Channel Capacity of Near-Field Multiuser Communications

Authors: Boqun Zhao, Chongjun Ouyang, Xingqi Zhang, Yuanwei Liu

Abstract: The channel capacity of near-field (NF) communications is characterized by considering three types of multiuser channels: i) multiple access channel (MAC), ii) broadcast channel (BC), and iii) multicast channel (MC). For NF MAC and BC, closed-form expressions are derived for the sum-rate capacity as well as the capacity region under a two-user scenario. These results are further extended to scenar… ▽ More The channel capacity of near-field (NF) communications is characterized by considering three types of multiuser channels: i) multiple access channel (MAC), ii) broadcast channel (BC), and iii) multicast channel (MC). For NF MAC and BC, closed-form expressions are derived for the sum-rate capacity as well as the capacity region under a two-user scenario. These results are further extended to scenarios with an arbitrary number of users. For NF MC, closed-form expressions are derived for the two-user channel capacity and the capacity upper bound with more users. Further insights are gleaned by exploring special cases, including scenarios with infinitely large array apertures, co-directional users, and linear arrays. Theoretical and numerical results are presented and compared with far-field communications to demonstrate that: i) the NF capacity of these three channels converges to finite values rather than growing unboundedly as the number of array elements increases; ii) the capacity of the MAC and BC with co-directional users can be improved by using the additional range dimensions in NF channels to reduce inter-user interference (IUI); and iii) the MC capacity benefits less from the NF effect compared to the MAC and BC, as multicasting is less sensitive to IUI. △ Less

Submitted 8 May, 2024; originally announced May 2024.

arXiv:2404.08343 [pdf, ps, other]

On the Impact of Reactive Region on the Near-Field Channel Gain

Authors: Chongjun Ouyang, Zhaolin Wang, Boqun Zhao, Xingqi Zhang, Yuanwei Liu

Abstract: The near-field channel gain is analyzed by considering both radiating and reactive components of the electromagnetic field. Novel expressions are derived for the channel gains of spatially-discrete (SPD) and continuous-aperture (CAP) arrays, which are more accurate than conventional results that neglect the reactive region. To gain further insights, asymptotic analyses are carried out in the large… ▽ More The near-field channel gain is analyzed by considering both radiating and reactive components of the electromagnetic field. Novel expressions are derived for the channel gains of spatially-discrete (SPD) and continuous-aperture (CAP) arrays, which are more accurate than conventional results that neglect the reactive region. To gain further insights, asymptotic analyses are carried out in the large aperture size, based on which the impact of the reactive region is discussed. It is proved that for both SPD and CAP arrays, the impact of the reactive region on near-field channel gain is negligible, even as the array aperture size approaches infinity. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 7 figures

arXiv:2404.01082 [pdf, other]

The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Liping Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation platform hinder the development of data-driven reconstruction algorithms. To address this issue, we organized the Cardiac MRI Reconstruction Challenge (CMRxRecon) in 2023, in collaboration with the 26th International Conference on MICCAI. CMRxRecon presented an extensive k-space dataset comprising cine and mapping raw data, accompanied by detailed annotations of cardiac anatomical structures. With overwhelming participation, the challenge attracted more than 285 teams and over 600 participants. Among them, 22 teams successfully submitted Docker containers for the testing phase, with 7 teams submitted for both cine and mapping tasks. All teams use deep learning based approaches, indicating that deep learning has predominately become a promising solution for the problem. The first-place winner of both tasks utilizes the E2E-VarNet architecture as backbones. In contrast, U-Net is still the most popular backbone for both multi-coil and single-coil reconstructions. This paper provides a comprehensive overview of the challenge design, presents a summary of the submitted results, reviews the employed methods, and offers an in-depth discussion that aims to inspire future advancements in cardiac MRI reconstruction models. The summary emphasizes the effective strategies observed in Cardiac MRI reconstruction, including backbone architecture, loss function, pre-processing techniques, physical modeling, and model complexity, thereby providing valuable insights for further developments in this field. △ Less

Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

Comments: 25 pages, 17 figures

arXiv:2403.12226 [pdf, other]

Large-scale flood modeling and forecasting with FloodCast

Authors: Qingsong Xu, Yilei Shi, Jonathan Bamber, Chaojun Ouyang, Xiao Xiang Zhu

Abstract: Large-scale hydrodynamic models generally rely on fixed-resolution spatial grids and model parameters as well as incurring a high computational cost. This limits their ability to accurately forecast flood crests and issue time-critical hazard warnings. In this work, we build a fast, stable, accurate, resolution-invariant, and geometry-adaptative flood modeling and forecasting framework that can pe… ▽ More Large-scale hydrodynamic models generally rely on fixed-resolution spatial grids and model parameters as well as incurring a high computational cost. This limits their ability to accurately forecast flood crests and issue time-critical hazard warnings. In this work, we build a fast, stable, accurate, resolution-invariant, and geometry-adaptative flood modeling and forecasting framework that can perform at large scales, namely FloodCast. The framework comprises two main modules: multi-satellite observation and hydrodynamic modeling. In the multi-satellite observation module, a real-time unsupervised change detection method and a rainfall processing and analysis tool are proposed to harness the full potential of multi-satellite observations in large-scale flood prediction. In the hydrodynamic modeling module, a geometry-adaptive physics-informed neural solver (GeoPINS) is introduced, benefiting from the absence of a requirement for training data in physics-informed neural networks and featuring a fast, accurate, and resolution-invariant architecture with Fourier neural operators. GeoPINS demonstrates impressive performance on popular PDEs across regular and irregular domains. Building upon GeoPINS, we propose a sequence-to-sequence GeoPINS model to handle long-term temporal series and extensive spatial domains in large-scale flood modeling. Next, we establish a benchmark dataset in the 2022 Pakistan flood to assess various flood prediction methods. Finally, we validate the model in three dimensions - flood inundation range, depth, and transferability of spatiotemporal downscaling. Traditional hydrodynamics and sequence-to-sequence GeoPINS exhibit exceptional agreement during high water levels, while comparative assessments with SAR-based flood depth data show that sequence-to-sequence GeoPINS outperforms traditional hydrodynamics, with smaller prediction errors. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: 40 pages, 16 figures, under review

arXiv:2403.09232 [pdf, other]

Generating Feasible and Plausible Counterfactual Explanations for Outcome Prediction of Business Processes

Authors: Alexander Stevens, Chun Ouyang, Johannes De Smedt, Catarina Moreira

Abstract: In recent years, various machine and deep learning architectures have been successfully introduced to the field of predictive process analytics. Nevertheless, the inherent opacity of these algorithms poses a significant challenge for human decision-makers, hindering their ability to understand the reasoning behind the predictions. This growing concern has sparked the introduction of counterfactual… ▽ More In recent years, various machine and deep learning architectures have been successfully introduced to the field of predictive process analytics. Nevertheless, the inherent opacity of these algorithms poses a significant challenge for human decision-makers, hindering their ability to understand the reasoning behind the predictions. This growing concern has sparked the introduction of counterfactual explanations, designed as human-understandable what if scenarios, to provide clearer insights into the decision-making process behind undesirable predictions. The generation of counterfactual explanations, however, encounters specific challenges when dealing with the sequential nature of the (business) process cases typically used in predictive process analytics. Our paper tackles this challenge by introducing a data-driven approach, REVISEDplus, to generate more feasible and plausible counterfactual explanations. First, we restrict the counterfactual algorithm to generate counterfactuals that lie within a high-density region of the process data, ensuring that the proposed counterfactuals are realistic and feasible within the observed process data distribution. Additionally, we ensure plausibility by learning sequential patterns between the activities in the process cases, utilising Declare language templates. Finally, we evaluate the properties that define the validity of counterfactuals. △ Less

Submitted 14 March, 2024; originally announced March 2024.

Comments: Journal Submission

arXiv:2403.06659 [pdf, other]

Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement

Authors: Che Liu, Zhongwei Wan, Cheng Ouyang, Anand Shah, Wenjia Bai, Rossella Arcucci

Abstract: Electrocardiograms (ECGs) are non-invasive diagnostic tools crucial for detecting cardiac arrhythmic diseases in clinical practice. While ECG Self-supervised Learning (eSSL) methods show promise in representation learning from unannotated ECG data, they often overlook the clinical knowledge that can be found in reports. This oversight and the requirement for annotated samples for downstream tasks… ▽ More Electrocardiograms (ECGs) are non-invasive diagnostic tools crucial for detecting cardiac arrhythmic diseases in clinical practice. While ECG Self-supervised Learning (eSSL) methods show promise in representation learning from unannotated ECG data, they often overlook the clinical knowledge that can be found in reports. This oversight and the requirement for annotated samples for downstream tasks limit eSSL's versatility. In this work, we address these issues with the Multimodal ECG Representation Learning (MERL}) framework. Through multimodal learning on ECG records and associated reports, MERL is capable of performing zero-shot ECG classification with text prompts, eliminating the need for training data in downstream tasks. At test time, we propose the Clinical Knowledge Enhanced Prompt Engineering (CKEPE) approach, which uses Large Language Models (LLMs) to exploit external expert-verified clinical knowledge databases, generating more descriptive prompts and reducing hallucinations in LLM-generated content to boost zero-shot classification. Based on MERL, we perform the first benchmark across six public ECG datasets, showing the superior performance of MERL compared against eSSL methods. Notably, MERL achieves an average AUC score of 75.2% in zero-shot classification (without training data), 3.2% higher than linear probed eSSL methods with 10\% annotated training data, averaged across all six datasets. Code and models are available at https://github.com/cheliu-computation/MERL △ Less

Submitted 2 July, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

Comments: Accepted by ICML2024

arXiv:2403.00189 [pdf, ps, other]

The Road to Next-Generation Multiple Access: A 50-Year Tutorial Review

Authors: Yuanwei Liu, Chongjun Ouyang, Zhiguo Ding, Robert Schober

Abstract: The evolution of wireless communications has been significantly influenced by remarkable advancements in multiple access (MA) technologies over the past five decades, shaping the landscape of modern connectivity. Within this context, a comprehensive tutorial review is presented, focusing on representative MA techniques developed over the past 50 years. The following areas are explored: i) The foun… ▽ More The evolution of wireless communications has been significantly influenced by remarkable advancements in multiple access (MA) technologies over the past five decades, shaping the landscape of modern connectivity. Within this context, a comprehensive tutorial review is presented, focusing on representative MA techniques developed over the past 50 years. The following areas are explored: i) The foundational principles and information-theoretic capacity limits of power-domain non-orthogonal multiple access (NOMA) are characterized, along with its extension to multiple-input multiple-output (MIMO)-NOMA. ii) Several MA transmission schemes exploiting the spatial domain are investigated, encompassing both conventional space-division multiple access (SDMA)/MIMO-NOMA systems and near-field MA systems utilizing spherical-wave propagation models. iii) The application of NOMA to integrated sensing and communications (ISAC) systems is studied. This includes an introduction to typical NOMA-based downlink/uplink ISAC frameworks, followed by an evaluation of their performance limits using a mutual information (MI)-based analytical framework. iv) Major issues and research opportunities associated with the integration of MA with other emerging technologies are identified to facilitate MA in next-generation networks, i.e., next-generation multiple access (NGMA). Throughout the paper, promising directions are highlighted to inspire future research endeavors in the realm of MA and NGMA. △ Less

Submitted 6 October, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

Comments: 46 pages; to appear in Proceedings of the IEEE

arXiv:2402.09463 [pdf]

Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results

Authors: Kelly Payette, Céline Steger, Roxane Licandro, Priscille de Dumast, Hongwei Bran Li, Matthew Barkovich, Liu Li, Maik Dannecker, Chen Chen, Cheng Ouyang, Niccolò McConnell, Alina Miron, Yongmin Li, Alena Uus, Irina Grigorescu, Paula Ramirez Gilliland, Md Mahfuzur Rahman Siddiquee, Daguang Xu, Andriy Myronenko, Haoyu Wang, Ziyan Huang, Jin Ye, Mireia Alenyà, Valentin Comte, Oscar Camara , et al. (42 additional authors not shown)

Abstract: Segmentation is a critical step in analyzing the developing human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across dif… ▽ More Segmentation is a critical step in analyzing the developing human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across different imaging centers remains unsolved, limiting real-world clinical applicability. The multi-center FeTA Challenge 2022 focuses on advancing the generalizability of fetal brain segmentation algorithms for magnetic resonance imaging (MRI). In FeTA 2022, the training dataset contained images and corresponding manually annotated multi-class labels from two imaging centers, and the testing data contained images from these two imaging centers as well as two additional unseen centers. The data from different centers varied in many aspects, including scanners used, imaging parameters, and fetal brain super-resolution algorithms applied. 16 teams participated in the challenge, and 17 algorithms were evaluated. Here, a detailed overview and analysis of the challenge results are provided, focusing on the generalizability of the submissions. Both in- and out of domain, the white matter and ventricles were segmented with the highest accuracy, while the most challenging structure remains the cerebral cortex due to anatomical complexity. The FeTA Challenge 2022 was able to successfully evaluate and advance generalizability of multi-class fetal brain tissue segmentation algorithms for MRI and it continues to benchmark new algorithms. The resulting new methods contribute to improving the analysis of brain development in utero. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: Results from FeTA Challenge 2022, held at MICCAI; Manuscript submitted. Supplementary Info (including submission methods descriptions) available here: https://zenodo.org/records/10628648

arXiv:2401.14896 [pdf, other]

Higher-order topology in Fibonacci quasicrystals

Authors: Chaozhi Ouyang, Qinghua He, Dong-Hui Xu, Feng Liu

Abstract: In crystalline systems, higher-order topology, characterized by topological states of codimension greater than one, typically arises from the mismatch between Wannier centers and atomic sites, leading to filling anomalies. However, this phenomenon is less understood in aperiodic systems, such as quasicrystals, where Wannier centers are not well defined. In this study, we examine Fibonacci chains a… ▽ More In crystalline systems, higher-order topology, characterized by topological states of codimension greater than one, typically arises from the mismatch between Wannier centers and atomic sites, leading to filling anomalies. However, this phenomenon is less understood in aperiodic systems, such as quasicrystals, where Wannier centers are not well defined. In this study, we examine Fibonacci chains and squares, a quintessential type of quasicrystal, to investigate their higher-order topological properties. We discover that topological interfacial states, including corner states, can be inherited from their higher-dimensional periodic counterparts, such as the two-dimensional Su-Schrieffer-Heeger model. This finding is validated through numerical simulations of both phononic and photonic Fibonacci quasicrystals by the finite element method, revealing the emergence of topological edge and corner states at interfaces between Fibonacci quasicrystals with differing topologies inherited from their parent systems. Our results not only provide insight into the higher-order topology of quasicrystals but also open avenues for exploring novel topological phases in aperiodic structures. △ Less

Submitted 26 January, 2024; originally announced January 2024.

Comments: 9 pages, 6 figures

arXiv:2401.14219 [pdf, other]

Active Simultaneously Transmitting and Reflecting Surface Assisted NOMA Networks

Authors: Xinwei Yue, Jin Xie, Chongjun Ouyang, Yuanwei Liu, Xia Shen, Zhiguo Ding

Abstract: The novel active simultaneously transmitting and reflecting surface (ASTARS) has recently received a lot of attention due to its capability to conquer the multiplicative fading loss and achieve full-space smart radio environments. This paper introduces the ASTARS to assist non-orthogonal multiple access (NOMA) communications, where the stochastic geometry theory is used to model the spatial positi… ▽ More The novel active simultaneously transmitting and reflecting surface (ASTARS) has recently received a lot of attention due to its capability to conquer the multiplicative fading loss and achieve full-space smart radio environments. This paper introduces the ASTARS to assist non-orthogonal multiple access (NOMA) communications, where the stochastic geometry theory is used to model the spatial positions of pairing users. We design the independent reflection/transmission phase-shift controllers of ASTARS to align the phases of cascaded channels at pairing users. We derive new closed-form and asymptotic expressions of the outage probability and ergodic data rate for ASTARS-NOMA networks in the presence of perfect/imperfect successive interference cancellation (pSIC). The diversity orders and multiplexing gains for ASTARS-NOMA are derived to provide more insights. Furthermore, the system throughputs of ASTARS-NOMA are investigated in both delay-tolerant and delay-limited transmission modes. The numerical results are presented and show that: 1) ASTARS-NOMA with pSIC outperforms ASTARS assisted-orthogonal multiple access (ASTARS-OMA) in terms of outage probability and ergodic data rate; 2) The outage probability of ASTARS-NOMA can be further reduced within a certain range by increasing the power amplification factors; 3) The system throughputs of ASTARS-NOMA are superior to that of ASTARS-OMA in both delay-limited and delay-tolerant transmission modes. △ Less

Submitted 25 January, 2024; originally announced January 2024.

arXiv:2401.14129 [pdf, ps, other]

Performance Analysis of Holographic MIMO Based Integrated Sensing and Communications

Authors: Boqun Zhao, Chongjun Ouyang, Xingqi Zhang, Yuanwei Liu

Abstract: Given the high spectral efficiency, holographic multiple-input multiple-output (MIMO) technology holds promise for enhancing both sensing and communication capabilities. However, accurately characterizing its performance poses a challenge due to the spatial correlation induced by densely spaced antennas. In this paper, a holographic MIMO (HMIMO) based integrated sensing and communications (ISAC) f… ▽ More Given the high spectral efficiency, holographic multiple-input multiple-output (MIMO) technology holds promise for enhancing both sensing and communication capabilities. However, accurately characterizing its performance poses a challenge due to the spatial correlation induced by densely spaced antennas. In this paper, a holographic MIMO (HMIMO) based integrated sensing and communications (ISAC) framework is proposed for both downlink and uplink scenarios. The spacial correlation is incorporated in the communication channel modeling, while an accurate spherical wave-based model is utilized to characterize sensing link. By considering both instantaneous channel state information (CSI) and statistical CSI, closed-form expressions are derived for sensing rates (SRs), communication rates (CRs), and outage probabilities under different ISAC designs to investigate the theoretical performance limits of the proposed HISAC framework. Further insights are gained by examining high signal-to-noise ratio slopes and diversity orders. Specifically, i) for the downlink case, a sensing-centric (S-C) design and a communications-centric (C-C) design are investigated based on different beamforming strategies, and a Pareto optimal design is proposed to characterize the attainable SR-CR region; ii) for the uplink case, the S-C design and the C-C design are distinguished by the interference cancellation order of the communication signal and the sensing signal, and the rate region is obtained through a time-sharing strategy. Numerical results reveal that HMIMO based ISAC (HISAC) systems outperform both conventional MIMO based ISAC systems and HMIMO based frequency-division sensing and communications systems, underscoring the superior performance of HISAC. △ Less

Submitted 8 May, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

arXiv:2401.05900 [pdf, other]

Near-Field Communications: A Comprehensive Survey

Authors: Yuanwei Liu, Chongjun Ouyang, Zhaolin Wang, Jiaqi Xu, Xidong Mu, A. Lee Swindlehurst

Abstract: Multiple-antenna technologies are evolving towards larger aperture sizes, extremely high frequencies, and innovative antenna types. This evolution is fostering the emergence of near-field communications (NFC) in future wireless systems. Considerable attention has been directed towards this cutting-edge technology due to its potential to enhance the capacity of wireless networks by introducing incr… ▽ More Multiple-antenna technologies are evolving towards larger aperture sizes, extremely high frequencies, and innovative antenna types. This evolution is fostering the emergence of near-field communications (NFC) in future wireless systems. Considerable attention has been directed towards this cutting-edge technology due to its potential to enhance the capacity of wireless networks by introducing increased spatial degrees of freedom (DoFs) in the range domain. Within this context, a comprehensive review of the state of the art on NFC is presented, with a specific focus on its 1) fundamental operating principles, 2) channel modeling, 3) performance analysis, 4) signal processing techniques, and 5) integration with other emerging applications. Specifically, 1) the basic principles of NFC are characterized from both physics and communications perspectives, unveiling its unique properties in contrast to far-field communications. 2) Building on these principles, deterministic and stochastic near-field channel models are explored for spatially-discrete (SPD) and continuous-aperture (CAP) arrays. 3) Based on these models, existing contributions to near-field performance analysis are reviewed in terms of DoFs/effective DoFs (EDoFs), the power scaling law, and transmission rate. 4) Existing signal processing techniques for NFC are systematically surveyed, which include channel estimation, beamforming design, and low-complexity beam training. 5) Major issues and research opportunities in incorporating near-field models into other promising technologies are identified to advance NFC's deployment in next-generation networks. Throughout this paper, promising directions are highlighted to inspire future research endeavors in the realm of NFC, underscoring its significance in the advancement of wireless communication technologies. △ Less

Submitted 3 October, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

Comments: 41 pages; to appear in IEEE Communications Surveys and Tutorials

arXiv:2401.05132 [pdf, other]

Dual Quaternion Weighted Directed Graph and Formation Control

Authors: Liqun Qi, Chunfeng Cui, Chen Ouyang

Abstract: We first study the multi-agent formation control problem in a directed graph. The relative configurations are expressed by unit dual quaternions (UDQs). We call such a weighted directed graph a unit dual quaternion weighted directed graph (UDQWDG). We show that a desired relative configuration scheme is reasonable in a {UDQWDG} if and only if for any cycle in this directed graph, the product of re… ▽ More We first study the multi-agent formation control problem in a directed graph. The relative configurations are expressed by unit dual quaternions (UDQs). We call such a weighted directed graph a unit dual quaternion weighted directed graph (UDQWDG). We show that a desired relative configuration scheme is reasonable in a {UDQWDG} if and only if for any cycle in this directed graph, the product of relative configurations of the forward arcs, and inverses of relative configurations of the backward arcs, is equal to $1$. We then show that a desired relative configuration scheme in a {directed connected} graph {is reasonable if and only if} the dual quaternion Laplacian is similar to the unweighted Laplacian of the directed graph. Then for a reasonable desired relative configuration scheme, we build the relationship between the desired formation and the eigenvector corresponding to the zero eigenvalue. A numerical method and a control law {are} presented. We then study dual quaternion weighted directed graphs (DQWDG). Ordinary graphs, gain graphs, signed directed graphs, complex weighted directed graphs and UDQWDGs are special cases of DQWDGs. A general theory of DQWDG is presented. △ Less

Submitted 19 August, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

arXiv:2401.01797 [pdf, other]

Parabolic Anderson model in bounded domains of recurrent metric measure spaces

Authors: Fabrice Baudoin, Li Chen, Che-Hung Huang, Cheng Ouyang, Samy Tindel, Jing Wang

Abstract: A metric measure space equipped with a Dirichlet form is called recurrent if its Hausdorff dimension is less than its walk dimension. In bounded domains of such spaces we study the parabolic Anderson models \[ \partial_{t} u(t,x) = Δu(t,x) + βu(t,x) \, \dot{W}_α(t,x) \] where the noise $W_α$ is white in time and colored in space when $α>0$ while for $α=0$ it is also white in space. Both Dirichlet… ▽ More A metric measure space equipped with a Dirichlet form is called recurrent if its Hausdorff dimension is less than its walk dimension. In bounded domains of such spaces we study the parabolic Anderson models \[ \partial_{t} u(t,x) = Δu(t,x) + βu(t,x) \, \dot{W}_α(t,x) \] where the noise $W_α$ is white in time and colored in space when $α>0$ while for $α=0$ it is also white in space. Both Dirichlet and Neumann boundary conditions are considered. Besides proving existence and uniqueness in the Itô sense we also get precise $L^p$ estimates for the moments and intermittency properties of the solution as a consequence. Our study reveals new exponents which are intrinsically associated to the geometry of the underlying space and the results for instance apply in metric graphs or fractals like the Sierpiński gasket for which we prove scaling invariance properties of the models. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: 50 pages, 3 figures

arXiv:2312.14018 [pdf, ps, other]

Enabling Secure Wireless Communications via Movable Antennas

Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Xiaoming She, Chongjun Ouyang, Peng Chen

Abstract: A pioneering secure transmission scheme is proposed, which harnesses movable antennas (MAs) to optimize antenna positions for augmenting the physical layer security. Particularly, an MA-enabled secure wireless system is considered, where a multi-antenna transmitter communicates with a single-antenna receiver in the presence of an eavesdropper. The beamformer and antenna positions at the transmitte… ▽ More A pioneering secure transmission scheme is proposed, which harnesses movable antennas (MAs) to optimize antenna positions for augmenting the physical layer security. Particularly, an MA-enabled secure wireless system is considered, where a multi-antenna transmitter communicates with a single-antenna receiver in the presence of an eavesdropper. The beamformer and antenna positions at the transmitter are jointly optimized under two criteria: power consumption minimization and secrecy rate maximization. For each scenario, a novel suboptimal algorithm was proposed to tackle the resulting nonconvex optimization problem, capitalizing on the approaches of alternating optimization and gradient descent. Numerical results demonstrate that the proposed MA systems significantly improve physical layer security compared to various benchmark schemes relying on conventional fixed-position antennas (FPAs). △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: Accepted by IEEE ICASSP 2024

arXiv:2312.01529 [pdf, other]

T3D: Towards 3D Medical Image Understanding through Vision-Language Pre-training

Authors: Che Liu, Cheng Ouyang, Yinda Chen, Cesar César Quilodrán-Casas, Lei Ma, Jie Fu, Yike Guo, Anand Shah, Wenjia Bai, Rossella Arcucci

Abstract: Expert annotation of 3D medical image for downstream analysis is resource-intensive, posing challenges in clinical applications. Visual self-supervised learning (vSSL), though effective for learning visual invariance, neglects the incorporation of domain knowledge from medicine. To incorporate medical knowledge into visual representation learning, vision-language pre-training (VLP) has shown promi… ▽ More Expert annotation of 3D medical image for downstream analysis is resource-intensive, posing challenges in clinical applications. Visual self-supervised learning (vSSL), though effective for learning visual invariance, neglects the incorporation of domain knowledge from medicine. To incorporate medical knowledge into visual representation learning, vision-language pre-training (VLP) has shown promising results in 2D image. However, existing VLP approaches become generally impractical when applied to high-resolution 3D medical images due to GPU hardware constraints and the potential loss of critical details caused by downsampling, which is the intuitive solution to hardware constraints. To address the above limitations, we introduce T3D, the first VLP framework designed for high-resolution 3D medical images. T3D incorporates two text-informed pretext tasks: (\lowerromannumeral{1}) text-informed contrastive learning; (\lowerromannumeral{2}) text-informed image restoration. These tasks focus on learning 3D visual representations from high-resolution 3D medical images and integrating clinical knowledge from radiology reports, without distorting information through forced alignment of downsampled volumes with detailed anatomical text. Trained on a newly curated large-scale dataset of 3D medical images and radiology reports, T3D significantly outperforms current vSSL methods in tasks like organ and tumor segmentation, as well as disease classification. This underlines T3D's potential in representation learning for 3D medical image analysis. All data and code will be available upon acceptance. △ Less

Submitted 5 December, 2023; v1 submitted 3 December, 2023; originally announced December 2023.

arXiv:2312.01522 [pdf, other]

G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training

Authors: Che Liu, Cheng Ouyang, Sibo Cheng, Anand Shah, Wenjia Bai, Rossella Arcucci

Abstract: Recently, medical vision-language pre-training (VLP) has reached substantial progress to learn global visual representation from medical images and their paired radiology reports. However, medical imaging tasks in real world usually require finer granularity in visual features. These tasks include visual localization tasks (e.g., semantic segmentation, object detection) and visual grounding task.… ▽ More Recently, medical vision-language pre-training (VLP) has reached substantial progress to learn global visual representation from medical images and their paired radiology reports. However, medical imaging tasks in real world usually require finer granularity in visual features. These tasks include visual localization tasks (e.g., semantic segmentation, object detection) and visual grounding task. Yet, current medical VLP methods face challenges in learning these fine-grained features, as they primarily focus on brute-force alignment between image patches and individual text tokens for local visual feature learning, which is suboptimal for downstream dense prediction tasks. In this work, we propose a new VLP framework, named \textbf{G}lobal to \textbf{D}ense level representation learning (G2D) that achieves significantly improved granularity and more accurate grounding for the learned features, compared to existing medical VLP approaches. In particular, G2D learns dense and semantically-grounded image representations via a pseudo segmentation task parallel with the global vision-language alignment. Notably, generating pseudo segmentation targets does not incur extra trainable parameters: they are obtained on the fly during VLP with a parameter-free processor. G2D achieves superior performance across 6 medical imaging tasks and 25 diseases, particularly in semantic segmentation, which necessitates fine-grained, semantically-grounded image features. In this task, G2D surpasses peer models even when fine-tuned with just 1\% of the training data, compared to the 100\% used by these models. The code will be released upon acceptance. △ Less

Submitted 17 October, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

Comments: Accepted by NeurIPS2024

arXiv:2311.14295 [pdf, ps, other]

Exploiting Active RIS in NOMA Networks with Hardware Impairments

Authors: Xinwei Yue, Meiqi Song, Chongjun Ouyang, Yuanwei Liu, Tian Li, Tianwei Hou

Abstract: Active reconfigurable intelligent surface (ARIS) is a promising way to compensate for multiplicative fading attenuation by amplifying and reflecting event signals to selected users. This paper investigates the performance of ARIS assisted non-orthogonal multiple access (NOMA) networks over cascaded Nakagami-m fading channels. The effects of hardware impairments (HIS) and reflection coefficients on… ▽ More Active reconfigurable intelligent surface (ARIS) is a promising way to compensate for multiplicative fading attenuation by amplifying and reflecting event signals to selected users. This paper investigates the performance of ARIS assisted non-orthogonal multiple access (NOMA) networks over cascaded Nakagami-m fading channels. The effects of hardware impairments (HIS) and reflection coefficients on ARIS-NOMA networks with imperfect successive interference cancellation (ipSIC) and perfect successive interference cancellation (pSIC) are considered. More specifically, we develop new precise and asymptotic expressions of outage probability and ergodic data rate with ipSIC/pSIC for ARIS-NOMA-HIS networks. According to the approximated analyses, the diversity orders and multiplexing gains for couple of non-orthogonal users are attained in detail. Additionally, the energy efficiency of ARIS-NOMA-HIS networks is surveyed in delay-limited and delay-tolerant transmission schemes. The simulation findings are presented to demonstrate that: i) The outage behaviors and ergodic data rates of ARIS-NOMA-HIS networks precede that of ARIS aided orthogonal multiple access (OMA) and passive reconfigurable intelligent surface (PRIS) aided OMA; ii) As the reflection coefficient of ARIS increases, ARIS-NOMA-HIS networks have the ability to provide the strengthened outage performance; and iii) ARIS-NOMA-HIS networks are more energy efficient than ARIS/PRIS-OMA networks and conventional cooperative schemes. △ Less

Submitted 12 January, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

arXiv:2311.06501 [pdf, ps, other]

Sum-Rate Optimization for RIS-Aided Multiuser Communications with Movable Antenna

Authors: Yunan Sun, Hao Xu, Chongjun Ouyang, Hongwen Yang

Abstract: Reconfigurable intelligent surface (RIS) is known as a promising technology to improve the performance of wireless communication networks, which has been extensively studied. Movable antenna (MA) is a novel technology that fully exploits the antenna position for enhancing the channel capacity. In this paper, we propose a new RIS-aided multiuser communication system with MAs. The sum-rate is maximi… ▽ More Reconfigurable intelligent surface (RIS) is known as a promising technology to improve the performance of wireless communication networks, which has been extensively studied. Movable antenna (MA) is a novel technology that fully exploits the antenna position for enhancing the channel capacity. In this paper, we propose a new RIS-aided multiuser communication system with MAs. The sum-rate is maximized by jointly optimizing the beamforming, the reflection coefficient (RC) values of RIS and the positions of MAs. A fractional programming-based iterative algorithm is proposed to solve the formulated non-convex problem, considering three assumptions for the RIS. Numerical results are presented to verify the effectiveness of the proposed algorithm and the superiority of the proposed MA-based system in terms of sum-rate. △ Less

Submitted 11 November, 2023; originally announced November 2023.

Comments: 5 pages

arXiv:2310.10917 [pdf, ps, other]

doi 10.1109/JSTSP.2024.3386054

Modeling and Analysis of Near-Field ISAC

Authors: Boqun Zhao, Chongjun Ouyang, Yuanwei Liu, Xingqi Zhang, H. Vincent Poor

Abstract: As the technical trends for the next-generation wireless network significantly extend the near-field region, a performance reevaluation of integrated sensing and communications (ISAC) with an appropriate channel model to account for the effects introduced by the near field becomes essential. In this paper, a near-field ISAC framework is proposed for both downlink and uplink scenarios based on an a… ▽ More As the technical trends for the next-generation wireless network significantly extend the near-field region, a performance reevaluation of integrated sensing and communications (ISAC) with an appropriate channel model to account for the effects introduced by the near field becomes essential. In this paper, a near-field ISAC framework is proposed for both downlink and uplink scenarios based on an accurate channel model. A uniform planar array is equipped at a base station, where the impacts of the effective aperture and polarization of antennas are considered. For the downlink case, three distinct designs are studied: a communications-centric (C-C) design, a sensing-centric (S-C) design, and a Pareto optimal design. Regarding the uplink case, the C-C design, the S-C design and a time-sharing strategy are considered. Within each design, sensing rates (SRs) and communication rates (CRs) are derived. To gain further insights, high signal-to-noise ratio slopes and rate scaling laws concerning the number of antennas are examined. The attainable near-field SR-CR regions of ISAC and the baseline frequency-division S&C are also characterized. Numerical results reveal that, as the number of antennas in the array grows, the SRs and CRs under our accurate model converge to finite values, while those under conventional far- and near-field models exhibit unbounded growth, highlighting the importance of precisely modeling the channels for near-field ISAC. △ Less

Submitted 12 April, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

Comments: Accepted by IEEE Journal of Selected Topics in Signal Processing

arXiv:2309.12596 [pdf, ps, other]

Movable Antenna-Empowered AirComp

Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Xiaoming She, Chongjun Ouyang, Peng Chen

Abstract: A novel over-the-air computation (AirComp) framework, empowered by the incorporation of movable antennas (MAs), is proposed to significantly enhance computation accuracy. Within this framework, the joint optimization of transmit power control, antenna positioning, and receive combining is investigated. An efficient method is proposed to tackle the problem of computation mean-squared error (MSE) mi… ▽ More A novel over-the-air computation (AirComp) framework, empowered by the incorporation of movable antennas (MAs), is proposed to significantly enhance computation accuracy. Within this framework, the joint optimization of transmit power control, antenna positioning, and receive combining is investigated. An efficient method is proposed to tackle the problem of computation mean-squared error (MSE) minimization, capitalizing on the approach of alternating optimization. Numerical results are provided to substantiate the superior MSE performance of the proposed framework, which establish its clear advantage over benchmark systems employing conventional fixed-position antennas (FPAs). △ Less

Submitted 21 September, 2023; originally announced September 2023.

arXiv:2309.11135 [pdf, ps, other]

Sum-Rate Maximization for Movable Antenna Enabled Multiuser Communications

Authors: Zhenqiao Cheng, Nanxi Li, Jianchi Zhu, Chongjun Ouyang

Abstract: A novel multiuser communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the downlink sum-rate. The joint optimization of the transmit beamforming vector and transmit MA positions is studied for a multiuser multiple-input single-input system. An efficient algorithm is proposed to tackle the formulated non-convex problem via cap… ▽ More A novel multiuser communication system with movable antennas (MAs) is proposed, where the antenna position optimization is exploited to enhance the downlink sum-rate. The joint optimization of the transmit beamforming vector and transmit MA positions is studied for a multiuser multiple-input single-input system. An efficient algorithm is proposed to tackle the formulated non-convex problem via capitalizing on fractional programming, alternating optimization, and gradient descent methods. To strike a better performance-complexity trade-off, a zero-forcing beamforming-based design is also proposed as an alternative. Numerical investigations are presented to verify the efficiency of the proposed algorithms and their superior performance compared with the benchmark relying on conventional fixed-position antennas (FPAs). △ Less

Submitted 22 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

Comments: 11 pages

arXiv:2308.16352 [pdf, ps, other]

Downlink and Uplink NOMA-ISAC with Signal Alignment

Authors: Boqun Zhao, Chongjun Ouyang, Xingqi Zhang, Yuanwei Liu

Abstract: Integrated Sensing and Communications (ISAC) surpasses the conventional frequency-division sensing and communications (FDSAC) in terms of spectrum, energy, and hardware efficiency, with potential for greater enhancement through integration of non-orthogonal multiple access (NOMA). Leveraging these advantages, a multiple-input multiple-output NOMA-ISAC framework is proposed in this paper, in which… ▽ More Integrated Sensing and Communications (ISAC) surpasses the conventional frequency-division sensing and communications (FDSAC) in terms of spectrum, energy, and hardware efficiency, with potential for greater enhancement through integration of non-orthogonal multiple access (NOMA). Leveraging these advantages, a multiple-input multiple-output NOMA-ISAC framework is proposed in this paper, in which the technique of signal alignment is adopted. The performance of the proposed framework for both downlink and uplink is analyzed. 1) The downlink ISAC is investigated under three different precoding designs: a sensing-centric (S-C) design, a communications-centric (C-C) design, and a Pareto optimal design. 2) For the uplink case, two scenarios are investigated: a S-C design and a C-C design, which vary based on the order of interference cancellation between the communication and sensing signals. In each of these scenarios, key performance metrics including sensing rate (SR), communication rate (CR), and outage probability are investigated. For a deeper understanding, the asymptotic performance of the system in the high signal-to-noise ratio (SNR) region is also explored, with a focus on the high-SNR slope and diversity order. Finally, the SR-CR rate regions achieved by ISAC and FDSAC are studied. Numerical results reveal that in both downlink and uplink cases, ISAC outperforms FDSAC in terms of sensing and communications performance and is capable of achieving a broader rate region, clearly showcasing its superiority. △ Less

Submitted 15 July, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

Comments: 16 pages, 7 figures

arXiv:2308.10802 [pdf, ps, other]

Parabolic Anderson model with colored noise on torus

Authors: Le Chen, Cheng Ouyang, William Vickery

Abstract: We construct an intrinsic family of Gaussian noises on $d$-dimensional flat torus $\mathbb{T}^d$. It is the analogue of the colored noise on $\mathbb{R}^d$, and allows us to study stochastic PDEs on torus in the Itô sense in high dimensions. With this noise, we consider the parabolic Anderson model (PAM) with measure-valued initial conditions and establish some basic properties of the solution, in… ▽ More We construct an intrinsic family of Gaussian noises on $d$-dimensional flat torus $\mathbb{T}^d$. It is the analogue of the colored noise on $\mathbb{R}^d$, and allows us to study stochastic PDEs on torus in the Itô sense in high dimensions. With this noise, we consider the parabolic Anderson model (PAM) with measure-valued initial conditions and establish some basic properties of the solution, including a sharp upper and lower bound for the moments and Hölder continuity in space and time. The study of the toy model of $\mathbb{T}^d$ in the present paper is a first step towards our effort in understanding how geometry and topology play an role in the behavior of stochastic PDEs on general (compact) manifolds. △ Less

Submitted 21 August, 2023; originally announced August 2023.

arXiv:2308.01146 [pdf, other]

UCDFormer: Unsupervised Change Detection Using a Transformer-driven Image Translation

Authors: Qingsong Xu, Yilei Shi, Jianhua Guo, Chaojun Ouyang, Xiao Xiang Zhu

Abstract: Change detection (CD) by comparing two bi-temporal images is a crucial task in remote sensing. With the advantages of requiring no cumbersome labeled change information, unsupervised CD has attracted extensive attention in the community. However, existing unsupervised CD approaches rarely consider the seasonal and style differences incurred by the illumination and atmospheric conditions in multi-t… ▽ More Change detection (CD) by comparing two bi-temporal images is a crucial task in remote sensing. With the advantages of requiring no cumbersome labeled change information, unsupervised CD has attracted extensive attention in the community. However, existing unsupervised CD approaches rarely consider the seasonal and style differences incurred by the illumination and atmospheric conditions in multi-temporal images. To this end, we propose a change detection with domain shift setting for remote sensing images. Furthermore, we present a novel unsupervised CD method using a light-weight transformer, called UCDFormer. Specifically, a transformer-driven image translation composed of a light-weight transformer and a domain-specific affinity weight is first proposed to mitigate domain shift between two images with real-time efficiency. After image translation, we can generate the difference map between the translated before-event image and the original after-event image. Then, a novel reliable pixel extraction module is proposed to select significantly changed/unchanged pixel positions by fusing the pseudo change maps of fuzzy c-means clustering and adaptive threshold. Finally, a binary change map is obtained based on these selected pixel pairs and a binary classifier. Experimental results on different unsupervised CD tasks with seasonal and style changes demonstrate the effectiveness of the proposed UCDFormer. For example, compared with several other related methods, UCDFormer improves performance on the Kappa coefficient by more than 12\%. In addition, UCDFormer achieves excellent performance for earthquake-induced landslide detection when considering large-scale applications. The code is available at \url{https://github.com/zhu-xlab/UCDFormer} △ Less

Submitted 2 August, 2023; originally announced August 2023.

Comments: 16 pages, 7 figures, IEEE Transactions on Geoscience and Remote Sensing

arXiv:2308.00362 [pdf, other]

Near-Field Communications: A Degree-of-Freedom Perspective

Authors: Chongjun Ouyang, Yuanwei Liu, Xingqi Zhang, Lajos Hanzo

Abstract: Multiple-antenna technologies are advancing towards large-scale aperture sizes and extremely high frequencies, leading to the emergence of near-field communications (NFC) in future wireless systems. To this context, we investigate the degree of freedom (DoF) in near-field multiple-input multiple-output (MIMO) systems. We consider both spatially discrete (SPD) antennas and continuous aperture (CAP)… ▽ More Multiple-antenna technologies are advancing towards large-scale aperture sizes and extremely high frequencies, leading to the emergence of near-field communications (NFC) in future wireless systems. To this context, we investigate the degree of freedom (DoF) in near-field multiple-input multiple-output (MIMO) systems. We consider both spatially discrete (SPD) antennas and continuous aperture (CAP) antennas. Additionally, we explore three important DoF-related performance metrics and examine their relationships with the classic DoF. Numerical results demonstrate the benefits of NFC over far-field communications (FFC) in terms of providing increased spatial DoFs. We also identify promising research directions for NFC from a DoF perspective. △ Less

Submitted 2 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: 8 pages

MSC Class: 94A05

arXiv:2307.15023 [pdf, ps, other]

Revealing the Impact of Beamforming in ISAC

Authors: Chongjun Ouyang, Yuanwei Liu, Xingqi Zhang

Abstract: This letter proposes advanced beamforming design and analyzes its influence on the sensing and communications (S&C) performance for a multiple-antenna integrated S&C (ISAC) system with a single communication user and a single target. Novel closed-form beamformers are derived for three typical scenarios, including the sensing-centric design, communications-centric design, and Pareto optimal design.… ▽ More This letter proposes advanced beamforming design and analyzes its influence on the sensing and communications (S&C) performance for a multiple-antenna integrated S&C (ISAC) system with a single communication user and a single target. Novel closed-form beamformers are derived for three typical scenarios, including the sensing-centric design, communications-centric design, and Pareto optimal design. Regarding each scenario, the outage probability, ergodic communication rate (CR), and sensing rate (SR) are analyzed to derive the diversity orders and high signal-to-noise ratio slopes. Numerical results are provided to demonstrate that i) beamforming design can affect the high-SNR power offset and diversity order but does not influence the high-SNR slope; ii) ISAC exhibits larger high-SNR slopes and a more extensive SR-CR region than conventional frequency-division S&C (FDSAC) techniques. △ Less

Submitted 27 July, 2023; originally announced July 2023.

Comments: 5 pages

MSC Class: 94A05

arXiv:2307.07957 [pdf, other]

Generalizable and explainable prediction of potential miRNA-disease associations based on heterogeneous graph learning

Authors: Yi Zhou, Meixuan Wu, Chengzhou Ouyang, Min Zhu

Abstract: Biomedical research has revealed the crucial role of miRNAs in the progression of many diseases, and computational prediction methods are increasingly proposed for assisting biological experiments to verify miRNA-disease associations (MDAs). However, the generalizability and explainability are currently underemphasized. It's significant to generalize effective predictions to entities with fewer or… ▽ More Biomedical research has revealed the crucial role of miRNAs in the progression of many diseases, and computational prediction methods are increasingly proposed for assisting biological experiments to verify miRNA-disease associations (MDAs). However, the generalizability and explainability are currently underemphasized. It's significant to generalize effective predictions to entities with fewer or no existing MDAs and reveal how the prediction scores are derived. In this study, our work contributes to data, model, and result analysis. First, for better formulation of the MDA issue, we integrate multi-source data into a heterogeneous graph with a broader learning and prediction scope, and we split massive verified MDAs into independent training, validation, and test sets as a benchmark. Second, we construct an end-to-end data-driven model that performs node feature encoding, graph structure learning, and binary prediction sequentially, with a heterogeneous graph transformer as the central module. Finally, computational experiments illustrate that our method outperforms existing state-of-the-art methods, achieving better evaluation metrics and alleviating the neglect of unknown miRNAs and diseases effectively. Case studies further demonstrate that we can make reliable MDA detections on diseases without MDA records, and the predictions can be explained in general and case by case. △ Less

Submitted 27 August, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

arXiv:2306.10909 [pdf, ps, other]

Uniqueness for a stochastic ideal dyadic MHD model

Authors: Mimi Dai, Qirui Peng, Cheng Ouyang

Abstract: We study a stochastic dyadic model with both forward and backward energy cascade mechanisms for the inviscid and non-resistive magnetohydrodynamics. For a particular class of stochastic forcing, we show weak uniqueness for the stochastic system. However the solution dissipates the energy which is formally an invariant quantity for the system. We study a stochastic dyadic model with both forward and backward energy cascade mechanisms for the inviscid and non-resistive magnetohydrodynamics. For a particular class of stochastic forcing, we show weak uniqueness for the stochastic system. However the solution dissipates the energy which is formally an invariant quantity for the system. △ Less

Submitted 19 June, 2023; originally announced June 2023.

Comments: 26 pages

MSC Class: 35Q35; 76B03; 76W05

arXiv:2305.17751 [pdf, other]

doi 10.1109/OJCOMS.2023.3305583

Near-Field Communications: A Tutorial Review

Authors: Yuanwei Liu, Zhaolin Wang, Jiaqi Xu, Chongjun Ouyang, Xidong Mu, Robert Schober

Abstract: Extremely large-scale antenna arrays, tremendously high frequencies, and new types of antennas are three clear trends in multi-antenna technology for supporting the sixth-generation (6G) networks. To properly account for the new characteristics introduced by these three trends in communication system design, the near-field spherical-wave propagation model needs to be used, which differs from the c… ▽ More Extremely large-scale antenna arrays, tremendously high frequencies, and new types of antennas are three clear trends in multi-antenna technology for supporting the sixth-generation (6G) networks. To properly account for the new characteristics introduced by these three trends in communication system design, the near-field spherical-wave propagation model needs to be used, which differs from the classical far-field planar-wave one. As such, near-field communication (NFC) will become essential in 6G networks. In this tutorial, we cover three key aspects of NFC. 1) Channel Modelling: We commence by reviewing near-field spherical-wave-based channel models for spatially-discrete (SPD) antennas. Then, uniform spherical wave (USW) and non-uniform spherical wave (NUSW) models are discussed. Subsequently, we introduce a general near-field channel model for SPD antennas and a Green's function-based channel model for continuous-aperture (CAP) antennas. 2) Beamfocusing and Antenna Architectures: We highlight the properties of near-field beamfocusing and discuss NFC antenna architectures for both SPD and CAP antennas. Moreover, the basic principles of near-field beam training are introduced. 3) Performance Analysis: Finally, we provide a comprehensive performance analysis framework for NFC. For near-field line-of-sight channels, the received signal-to-noise ratio and power-scaling law are derived. For statistical near-field multipath channels, a general analytical framework is proposed, based on which analytical expressions for the outage probability, ergodic channel capacity, and ergodic mutual information are obtained. Finally, for each aspect, topics for future research are discussed. △ Less

Submitted 5 September, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

Comments: 48 pages, 37 figures

Showing 1–50 of 168 results for author: Ouyang, C