-
BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain
Authors:
Kaisi Guan,
Qian Cao,
Yuchong Sun,
Xiting Wang,
Ruihua Song
Abstract:
Retrieval Augmented Generation (RAG) system is important in domains such as e-commerce, which has many long-tail entities and frequently updated information. Most existing works adopt separate modules for retrieval and generation, which may be suboptimal since the retrieval task and the generation task cannot benefit from each other to improve performance. We propose a novel Backbone Shared RAG fr…
▽ More
Retrieval Augmented Generation (RAG) system is important in domains such as e-commerce, which has many long-tail entities and frequently updated information. Most existing works adopt separate modules for retrieval and generation, which may be suboptimal since the retrieval task and the generation task cannot benefit from each other to improve performance. We propose a novel Backbone Shared RAG framework (BSharedRAG). It first uses a domain-specific corpus to continually pre-train a base model as a domain-specific backbone model and then trains two plug-and-play Low-Rank Adaptation (LoRA) modules based on the shared backbone to minimize retrieval and generation losses respectively. Experimental results indicate that our proposed BSharedRAG outperforms baseline models by 5% and 13% in Hit@3 upon two datasets in retrieval evaluation and by 23% in terms of BLEU-3 in generation evaluation. Our codes, models, and dataset are available at https://bsharedrag.github.io.
△ Less
Submitted 30 September, 2024;
originally announced September 2024.
-
Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning
Authors:
Wei An,
Xiao Bi,
Guanting Chen,
Shanhuang Chen,
Chengqi Deng,
Honghui Ding,
Kai Dong,
Qiushi Du,
Wenjun Gao,
Kang Guan,
Jianzhong Guo,
Yongqiang Guo,
Zhe Fu,
Ying He,
Panpan Huang,
Jiashi Li,
Wenfeng Liang,
Xiaodong Liu,
Xin Liu,
Yiyuan Liu,
Yuxuan Liu,
Shanghao Lu,
Xuan Lu,
Xiaotao Nie,
Tian Pei
, et al. (27 additional authors not shown)
Abstract:
The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic…
▽ More
The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic hardware-software co-design framework and its best practices. For DL training, we deployed the Fire-Flyer 2 with 10,000 PCIe A100 GPUs, achieved performance approximating the DGX-A100 while reducing costs by half and energy consumption by 40%. We specifically engineered HFReduce to accelerate allreduce communication and implemented numerous measures to keep our Computation-Storage Integrated Network congestion-free. Through our software stack, including HaiScale, 3FS, and HAI-Platform, we achieved substantial scalability by overlapping computation and communication. Our system-oriented experience from DL training provides valuable insights to drive future advancements in AI-HPC.
△ Less
Submitted 31 August, 2024; v1 submitted 26 August, 2024;
originally announced August 2024.
-
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Authors:
DeepSeek-AI,
Qihao Zhu,
Daya Guo,
Zhihong Shao,
Dejian Yang,
Peiyi Wang,
Runxin Xu,
Y. Wu,
Yukun Li,
Huazuo Gao,
Shirong Ma,
Wangding Zeng,
Xiao Bi,
Zihui Gu,
Hanwei Xu,
Damai Dai,
Kai Dong,
Liyue Zhang,
Yishi Piao,
Zhibin Gou,
Zhenda Xie,
Zhewen Hao,
Bingxuan Wang,
Junxiao Song,
Deli Chen
, et al. (15 additional authors not shown)
Abstract:
We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathe…
▽ More
We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathematical reasoning capabilities of DeepSeek-V2, while maintaining comparable performance in general language tasks. Compared to DeepSeek-Coder-33B, DeepSeek-Coder-V2 demonstrates significant advancements in various aspects of code-related tasks, as well as reasoning and general capabilities. Additionally, DeepSeek-Coder-V2 expands its support for programming languages from 86 to 338, while extending the context length from 16K to 128K. In standard benchmark evaluations, DeepSeek-Coder-V2 achieves superior performance compared to closed-source models such as GPT4-Turbo, Claude 3 Opus, and Gemini 1.5 Pro in coding and math benchmarks.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Authors:
DeepSeek-AI,
Aixin Liu,
Bei Feng,
Bin Wang,
Bingxuan Wang,
Bo Liu,
Chenggang Zhao,
Chengqi Dengr,
Chong Ruan,
Damai Dai,
Daya Guo,
Dejian Yang,
Deli Chen,
Dongjie Ji,
Erhang Li,
Fangyun Lin,
Fuli Luo,
Guangbo Hao,
Guanting Chen,
Guowei Li,
H. Zhang,
Hanwei Xu,
Hao Yang,
Haowei Zhang,
Honghui Ding
, et al. (132 additional authors not shown)
Abstract:
We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference…
▽ More
We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference through significantly compressing the Key-Value (KV) cache into a latent vector, while DeepSeekMoE enables training strong models at an economical cost through sparse computation. Compared with DeepSeek 67B, DeepSeek-V2 achieves significantly stronger performance, and meanwhile saves 42.5% of training costs, reduces the KV cache by 93.3%, and boosts the maximum generation throughput to 5.76 times. We pretrain DeepSeek-V2 on a high-quality and multi-source corpus consisting of 8.1T tokens, and further perform Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) to fully unlock its potential. Evaluation results show that, even with only 21B activated parameters, DeepSeek-V2 and its chat versions still achieve top-tier performance among open-source models.
△ Less
Submitted 19 June, 2024; v1 submitted 7 May, 2024;
originally announced May 2024.
-
Physical Property Understanding from Language-Embedded Feature Fields
Authors:
Albert J. Zhai,
Yuan Shen,
Emily Y. Chen,
Gloria X. Wang,
Xinlei Wang,
Sheng Wang,
Kaiyu Guan,
Shenlong Wang
Abstract:
Can computers perceive the physical properties of objects solely through vision? Research in cognitive science and vision science has shown that humans excel at identifying materials and estimating their physical properties based purely on visual appearance. In this paper, we present a novel approach for dense prediction of the physical properties of objects using a collection of images. Inspired…
▽ More
Can computers perceive the physical properties of objects solely through vision? Research in cognitive science and vision science has shown that humans excel at identifying materials and estimating their physical properties based purely on visual appearance. In this paper, we present a novel approach for dense prediction of the physical properties of objects using a collection of images. Inspired by how humans reason about physics through vision, we leverage large language models to propose candidate materials for each object. We then construct a language-embedded point cloud and estimate the physical properties of each 3D point using a zero-shot kernel regression approach. Our method is accurate, annotation-free, and applicable to any object in the open world. Experiments demonstrate the effectiveness of the proposed approach in various physical property reasoning tasks, such as estimating the mass of common objects, as well as other properties like friction and hardness.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques
Authors:
Yu Xiong,
Zhipeng Hu,
Ye Huang,
Runze Wu,
Kai Guan,
Xingchen Fang,
Ji Jiang,
Tianze Zhou,
Yujing Hu,
Haoyu Liu,
Tangjie Lyu,
Changjie Fan
Abstract:
Reinforcement Learning (RL) has demonstrated substantial potential across diverse fields, yet understanding its decision-making process, especially in real-world scenarios where rationality and safety are paramount, is an ongoing challenge. This paper delves in to Explainable RL (XRL), a subfield of Explainable AI (XAI) aimed at unravelling the complexities of RL models. Our focus rests on state-e…
▽ More
Reinforcement Learning (RL) has demonstrated substantial potential across diverse fields, yet understanding its decision-making process, especially in real-world scenarios where rationality and safety are paramount, is an ongoing challenge. This paper delves in to Explainable RL (XRL), a subfield of Explainable AI (XAI) aimed at unravelling the complexities of RL models. Our focus rests on state-explaining techniques, a crucial subset within XRL methods, as they reveal the underlying factors influencing an agent's actions at any given time. Despite their significant role, the lack of a unified evaluation framework hinders assessment of their accuracy and effectiveness. To address this, we introduce XRL-Bench, a unified standardized benchmark tailored for the evaluation and comparison of XRL methods, encompassing three main modules: standard RL environments, explainers based on state importance, and standard evaluators. XRL-Bench supports both tabular and image data for state explanation. We also propose TabularSHAP, an innovative and competitive XRL method. We demonstrate the practical utility of TabularSHAP in real-world online gaming services and offer an open-source benchmark platform for the straightforward implementation and evaluation of XRL methods. Our contributions facilitate the continued progression of XRL technology.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects
Authors:
Tianhang Cheng,
Wei-Chiu Ma,
Kaiyu Guan,
Antonio Torralba,
Shenlong Wang
Abstract:
Our world is full of identical objects (\emphe.g., cans of coke, cars of same model). These duplicates, when seen together, provide additional and strong cues for us to effectively reason about 3D. Inspired by this observation, we introduce Structure from Duplicates (SfD), a novel inverse graphics framework that reconstructs geometry, material, and illumination from a single image containing multi…
▽ More
Our world is full of identical objects (\emphe.g., cans of coke, cars of same model). These duplicates, when seen together, provide additional and strong cues for us to effectively reason about 3D. Inspired by this observation, we introduce Structure from Duplicates (SfD), a novel inverse graphics framework that reconstructs geometry, material, and illumination from a single image containing multiple identical objects. SfD begins by identifying multiple instances of an object within an image, and then jointly estimates the 6DoF pose for all instances.An inverse graphics pipeline is subsequently employed to jointly reason about the shape, material of the object, and the environment light, while adhering to the shared geometry and material constraint across instances. Our primary contributions involve utilizing object duplicates as a robust prior for single-image inverse graphics and proposing an in-plane rotation-robust Structure from Motion (SfM) formulation for joint 6-DoF object pose estimation. By leveraging multi-view cues from a single image, SfD generates more realistic and detailed 3D reconstructions, significantly outperforming existing single image reconstruction models and multi-view reconstruction approaches with a similar or greater number of observations.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Authors:
DeepSeek-AI,
:,
Xiao Bi,
Deli Chen,
Guanting Chen,
Shanhuang Chen,
Damai Dai,
Chengqi Deng,
Honghui Ding,
Kai Dong,
Qiushi Du,
Zhe Fu,
Huazuo Gao,
Kaige Gao,
Wenjun Gao,
Ruiqi Ge,
Kang Guan,
Daya Guo,
Jianzhong Guo,
Guangbo Hao,
Zhewen Hao,
Ying He,
Wenjie Hu,
Panpan Huang,
Erhang Li
, et al. (63 additional authors not shown)
Abstract:
The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B…
▽ More
The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B and 67B. Guided by the scaling laws, we introduce DeepSeek LLM, a project dedicated to advancing open-source language models with a long-term perspective. To support the pre-training phase, we have developed a dataset that currently consists of 2 trillion tokens and is continuously expanding. We further conduct supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) on DeepSeek LLM Base models, resulting in the creation of DeepSeek Chat models. Our evaluation results demonstrate that DeepSeek LLM 67B surpasses LLaMA-2 70B on various benchmarks, particularly in the domains of code, mathematics, and reasoning. Furthermore, open-ended evaluations reveal that DeepSeek LLM 67B Chat exhibits superior performance compared to GPT-3.5.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
From Past to Future: Digital Methods Towards Artefact Analysis
Authors:
Andrew Harris,
Andrea Cremaschi,
Tse Siang Lim,
Maria De Iorio,
Kwa Chong Guan
Abstract:
Over the past two decades, Digital Humanities has transformed the landscape of humanities and social sciences, enabling advanced computational analysis and interpretation of extensive datasets. Notably, recent initiatives in Southeast Asia, particularly in Singapore, focus on categorising and archiving historical data such as artwork, literature and, most notably archaeological artefacts. This stu…
▽ More
Over the past two decades, Digital Humanities has transformed the landscape of humanities and social sciences, enabling advanced computational analysis and interpretation of extensive datasets. Notably, recent initiatives in Southeast Asia, particularly in Singapore, focus on categorising and archiving historical data such as artwork, literature and, most notably archaeological artefacts. This study illustrates the profound potential of Digital Humanities through the application of statistical methods on two distinct artefact datasets. Specifically, we present the results of an automated die study on mid-1st millennium AD "Rising Sun" coinage from mainland Southeast Asia, while subsequently utilising unsupervised statistical methods on 2D images of 13th-14th century earthenware ceramics excavated from the precolonial St. Andrew's Cathedral site in central Singapore. This research offers a comparative assessment showcasing the transformative impact of statistics-based approaches on the interpretation and analysis of diverse archaeological materials and within Digital Humanities overall.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Networked Time Series Imputation via Position-aware Graph Enhanced Variational Autoencoders
Authors:
Dingsu Wang,
Yuchen Yan,
Ruizhong Qiu,
Yada Zhu,
Kaiyu Guan,
Andrew J Margenot,
Hanghang Tong
Abstract:
Multivariate time series (MTS) imputation is a widely studied problem in recent years. Existing methods can be divided into two main groups, including (1) deep recurrent or generative models that primarily focus on time series features, and (2) graph neural networks (GNNs) based models that utilize the topological information from the inherent graph structure of MTS as relational inductive bias fo…
▽ More
Multivariate time series (MTS) imputation is a widely studied problem in recent years. Existing methods can be divided into two main groups, including (1) deep recurrent or generative models that primarily focus on time series features, and (2) graph neural networks (GNNs) based models that utilize the topological information from the inherent graph structure of MTS as relational inductive bias for imputation. Nevertheless, these methods either neglect topological information or assume the graph structure is fixed and accurately known. Thus, they fail to fully utilize the graph dynamics for precise imputation in more challenging MTS data such as networked time series (NTS), where the underlying graph is constantly changing and might have missing edges. In this paper, we propose a novel approach to overcome these limitations. First, we define the problem of imputation over NTS which contains missing values in both node time series features and graph structures. Then, we design a new model named PoGeVon which leverages variational autoencoder (VAE) to predict missing values over both node time series features and graph structures. In particular, we propose a new node position embedding based on random walk with restart (RWR) in the encoder with provable higher expressive power compared with message-passing based graph neural networks (GNNs). We further design a decoder with 3-stage predictions from the perspective of multi-task learning to impute missing values in both time series and graph structures reciprocally. Experiment results demonstrate the effectiveness of our model over baselines.
△ Less
Submitted 26 June, 2023; v1 submitted 29 May, 2023;
originally announced May 2023.
-
6G Enabled Advanced Transportation Systems
Authors:
Ruiqi Liu,
Meng Hua,
Ke Guan,
Xiping Wang,
Leyi Zhang,
Tianqi Mao,
Di Zhang,
Qingqing Wu,
Abbas Jamalipour
Abstract:
With the emergence of communication services with stringent requirements such as autonomous driving or on-flight Internet, the sixth-generation (6G) wireless network is envisaged to become an enabling technology for future transportation systems. In this paper, two ways of interactions between 6G networks and transportation are extensively investigated. On one hand, the new usage scenarios and cap…
▽ More
With the emergence of communication services with stringent requirements such as autonomous driving or on-flight Internet, the sixth-generation (6G) wireless network is envisaged to become an enabling technology for future transportation systems. In this paper, two ways of interactions between 6G networks and transportation are extensively investigated. On one hand, the new usage scenarios and capabilities of 6G over existing cellular networks are firstly highlighted. Then, its potential in seamless and ubiquitous connectivity across the heterogeneous space-air-ground transportation systems is demonstrated, where railways, airplanes, high-altitude platforms and satellites are investigated. On the other hand, we reveal that the introduction of 6G guarantees a more intelligent, efficient and secure transportation system. Specifically, technical analysis on how 6G can empower future transportation is provided, based on the latest research and standardization progresses in localization, integrated sensing and communications, and security. The technical challenges and insights for a road ahead are also summarized for possible inspirations on 6G enabled advanced transportation.
△ Less
Submitted 11 December, 2023; v1 submitted 24 May, 2023;
originally announced May 2023.
-
A 3D Modeling Method for Scattering on Rough Surfaces at the Terahertz Band
Authors:
Ben Chen,
Ke Guan,
Danping He,
Pengxiang Xie,
Zhangdui Zhong,
Jianwu Dou,
Shahid Mumtaz,
Wael Bazzi
Abstract:
The terahertz (THz) band (0.1-10 THz) is widely considered to be a candidate band for the sixth-generation mobile communication technology (6G). However, due to its short wavelength (less than 1 mm), scattering becomes a particularly significant propagation mechanism. In previous studies, we proposed a scattering model to characterize the scattering in THz bands, which can only reconstruct the sca…
▽ More
The terahertz (THz) band (0.1-10 THz) is widely considered to be a candidate band for the sixth-generation mobile communication technology (6G). However, due to its short wavelength (less than 1 mm), scattering becomes a particularly significant propagation mechanism. In previous studies, we proposed a scattering model to characterize the scattering in THz bands, which can only reconstruct the scattering in the incidence plane. In this paper, a three-dimensional (3D) stochastic model is proposed to characterize the THz scattering on rough surfaces. Then, we reconstruct the scattering on rough surfaces with different shapes and under different incidence angles utilizing the proposed model. Good agreements can be achieved between the proposed model and full-wave simulation results. This stochastic 3D scattering model can be integrated into the standard channel modeling framework to realize more realistic THz channel data for the evaluation of 6G.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Authors:
Stephanie Milani,
Anssi Kanervisto,
Karolis Ramanauskas,
Sander Schulhoff,
Brandon Houghton,
Sharada Mohanty,
Byron Galbraith,
Ke Chen,
Yan Song,
Tianze Zhou,
Bingquan Yu,
He Liu,
Kai Guan,
Yujing Hu,
Tangjie Lv,
Federico Malato,
Florian Leopold,
Amogh Raut,
Ville Hautamäki,
Andrew Melnik,
Shu Ishida,
João F. Henriques,
Robert Klassert,
Walter Laurito,
Ellen Novoseller
, et al. (5 additional authors not shown)
Abstract:
To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms to solve tasks with hard-to-specify reward functions in Minecraft. Through this competition, we aimed to promote the development of algorithms that use…
▽ More
To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms to solve tasks with hard-to-specify reward functions in Minecraft. Through this competition, we aimed to promote the development of algorithms that use human feedback as channels to learn the desired behavior. We describe the competition and provide an overview of the top solutions. We conclude by discussing the impact of the competition and future directions for improvement.
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
A Ray-tracing and Deep Learning Fusion Super-resolution Modeling Method for Wireless Mobile Channel
Authors:
Zhao Zhang,
Danping He,
Xiping Wang,
Ke Guan,
Zhangdui Zhong,
Jianwu Dou
Abstract:
Mobile channel modeling has always been the core part for design, deployment and optimization of communication system, especially in 5G and beyond era. Deterministic channel modeling could precisely achieve mobile channel description, however with defects of equipment and time consuming. In this paper, we proposed a novel super resolution (SR) model for cluster characteristics prediction. The mode…
▽ More
Mobile channel modeling has always been the core part for design, deployment and optimization of communication system, especially in 5G and beyond era. Deterministic channel modeling could precisely achieve mobile channel description, however with defects of equipment and time consuming. In this paper, we proposed a novel super resolution (SR) model for cluster characteristics prediction. The model is based on deep neural networks with residual connection. A series of simulations at 3.5 GHz are conducted by a three-dimensional ray tracing (RT) simulator in diverse scenarios. Cluster characteristics are extracted and corresponding data sets are constructed to train the model. Experiments demonstrate that the proposed SR approach could achieve better power and cluster location prediction performance than traditional interpolation method and the root mean square error (RMSE) drops by 51% and 78% relatively. Channel impulse response (CIR) is reconstructed based on cluster characteristics, which could match well with the multi-path component (MPC). The proposed method can be used to efficiently and accurately generate big data of mobile channel, which significantly reduces the computation time of RT-only.
△ Less
Submitted 27 January, 2023;
originally announced January 2023.
-
Super-resolution of Ray-tracing Channel Simulation via Attention Mechanism based Deep Learning Model
Authors:
Haoyang Zhang,
Danping He,
Xiping Wang,
Wenbin Wang,
Yunhao Cheng,
Ke Guan
Abstract:
As an emerging approach, deep learning plays an increasingly influential role in channel modeling. Traditional ray tracing (RT) methods of channel modeling tend to be inefficient and expensive. In this paper, we present a super-resolution (SR) model for channel characteristics. Residual connection and attention mechanism are applied to this convolutional neural network (CNN) model. Experiments pro…
▽ More
As an emerging approach, deep learning plays an increasingly influential role in channel modeling. Traditional ray tracing (RT) methods of channel modeling tend to be inefficient and expensive. In this paper, we present a super-resolution (SR) model for channel characteristics. Residual connection and attention mechanism are applied to this convolutional neural network (CNN) model. Experiments prove that the proposed model can reduce the noise interference generated in the SR process and solve the problem of low efficiency of RT. The mean absolute error of our channel SR model on the PL achieves the effect of 2.82 dB with scale factor 2, the same accuracy as RT took only 52\% of the time in theory. Compared with vision transformer (ViT), the proposed model also demonstrates less running time and computing cost in SR of channel characteristics.
△ Less
Submitted 21 January, 2023; v1 submitted 11 January, 2023;
originally announced January 2023.
-
Retrieval Based Time Series Forecasting
Authors:
Baoyu Jing,
Si Zhang,
Yada Zhu,
Bin Peng,
Kaiyu Guan,
Andrew Margenot,
Hanghang Tong
Abstract:
Time series data appears in a variety of applications such as smart transportation and environmental monitoring. One of the fundamental problems for time series analysis is time series forecasting. Despite the success of recent deep time series forecasting methods, they require sufficient observation of historical values to make accurate forecasting. In other words, the ratio of the output length…
▽ More
Time series data appears in a variety of applications such as smart transportation and environmental monitoring. One of the fundamental problems for time series analysis is time series forecasting. Despite the success of recent deep time series forecasting methods, they require sufficient observation of historical values to make accurate forecasting. In other words, the ratio of the output length (or forecasting horizon) to the sum of the input and output lengths should be low enough (e.g., 0.3). As the ratio increases (e.g., to 0.8), the uncertainty for the forecasting accuracy increases significantly. In this paper, we show both theoretically and empirically that the uncertainty could be effectively reduced by retrieving relevant time series as references. In the theoretical analysis, we first quantify the uncertainty and show its connections to the Mean Squared Error (MSE). Then we prove that models with references are easier to learn than models without references since the retrieved references could reduce the uncertainty. To empirically demonstrate the effectiveness of the retrieval based time series forecasting models, we introduce a simple yet effective two-stage method, called ReTime consisting of a relational retrieval and a content synthesis. We also show that ReTime can be easily adapted to the spatial-temporal time series and time series imputation settings. Finally, we evaluate ReTime on real-world datasets to demonstrate its effectiveness.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
A Multi-Task Learning Model for Super Resolution of Wireless Channel Characteristics
Authors:
Xiping Wang,
Zhao Zhang,
Danping He,
Ke Guan,
Dongliang Liu,
Jianwu Dou
Abstract:
Channel modeling has always been the core part in communication system design and development, especially in 5G and 6G era. Traditional approaches like stochastic channel modeling and ray-tracing (RT) based channel modeling depend heavily on measurement data or simulation, which are usually expensive and time consuming. In this paper, we propose a novel super resolution (SR) model for generating c…
▽ More
Channel modeling has always been the core part in communication system design and development, especially in 5G and 6G era. Traditional approaches like stochastic channel modeling and ray-tracing (RT) based channel modeling depend heavily on measurement data or simulation, which are usually expensive and time consuming. In this paper, we propose a novel super resolution (SR) model for generating channel characteristics data. The model is based on multi-task learning (MTL) convolutional neural networks (CNN) with residual connection. Experiments demonstrate that the proposed SR model could achieve excellent performances in mean absolute error and standard deviation of error. Advantages of the proposed model are demonstrated in comparisons with other state-of-the-art deep learning models. Ablation study also proved the necessity of multi-task learning and techniques in model design. The contribution in this paper could be helpful in channel modeling, network optimization, positioning and other wireless channel characteristics related work by largely reducing workload of simulation or measurement.
△ Less
Submitted 9 September, 2022;
originally announced September 2022.
-
Adaptive Transfer Learning for Plant Phenotyping
Authors:
Jun Wu,
Elizabeth A. Ainsworth,
Sheng Wang,
Kaiyu Guan,
Jingrui He
Abstract:
Plant phenotyping (Guo et al. 2021; Pieruschka et al. 2019) focuses on studying the diverse traits of plants related to the plants' growth. To be more specific, by accurately measuring the plant's anatomical, ontogenetical, physiological and biochemical properties, it allows identifying the crucial factors of plants' growth in different environments. One commonly used approach is to predict the pl…
▽ More
Plant phenotyping (Guo et al. 2021; Pieruschka et al. 2019) focuses on studying the diverse traits of plants related to the plants' growth. To be more specific, by accurately measuring the plant's anatomical, ontogenetical, physiological and biochemical properties, it allows identifying the crucial factors of plants' growth in different environments. One commonly used approach is to predict the plant's traits using hyperspectral reflectance (Yendrek et al. 2017; Wang et al. 2021). However, the data distributions of the hyperspectral reflectance data in plant phenotyping might vary in different environments for different plants. That is, it would be computationally expansive to learn the machine learning models separately for one plant in different environments. To solve this problem, we focus on studying the knowledge transferability of modern machine learning models in plant phenotyping. More specifically, this work aims to answer the following questions. (1) How is the performance of conventional machine learning models, e.g., partial least squares regression (PLSR), Gaussian process regression (GPR) and multi-layer perceptron (MLP), affected by the number of annotated samples for plant phenotyping? (2) Whether could the neural network based transfer learning models improve the performance of plant phenotyping? (3) Could the neural network based transfer learning be improved by using infinite-width hidden layers for plant phenotyping?
△ Less
Submitted 13 January, 2022;
originally announced January 2022.
-
Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network
Authors:
Bile Peng,
Jan-Aike Termöhlen,
Cong Sun,
Danping He,
Ke Guan,
Tim Fingscheidt,
Eduard A. Jorswieck
Abstract:
Reconfigurable intelligent surface (RIS) is an emerging technology for future wireless communication systems. In this work, we consider downlink spatial multiplexing enabled by the RIS for weighted sum-rate (WSR) maximization. In the literature, most solutions use alternating gradient-based optimization, which has moderate performance, high complexity, and limited scalability. We propose to apply…
▽ More
Reconfigurable intelligent surface (RIS) is an emerging technology for future wireless communication systems. In this work, we consider downlink spatial multiplexing enabled by the RIS for weighted sum-rate (WSR) maximization. In the literature, most solutions use alternating gradient-based optimization, which has moderate performance, high complexity, and limited scalability. We propose to apply a fully convolutional network (FCN) to solve this problem, which was originally designed for semantic segmentation of images. The rectangular shape of the RIS and the spatial correlation of channels with adjacent RIS antennas due to the short distance between them encourage us to apply it for the RIS configuration. We design a set of channel features that includes both cascaded channels via the RIS and the direct channel. In the base station (BS), the differentiable minimum mean squared error (MMSE) precoder is used for pretraining and the weighted minimum mean squared error (WMMSE) precoder is then applied for fine-tuning, which is nondifferentiable, more complex, but achieves a better performance. Evaluation results show that the proposed solution has higher performance and allows for a faster evaluation than the baselines. Hence it scales better to a large number of antennas, advancing the RIS one step closer to practical deployment.
△ Less
Submitted 21 September, 2022; v1 submitted 8 January, 2022;
originally announced January 2022.
-
Graph-based Ensemble Machine Learning for Student Performance Prediction
Authors:
Yinkai Wang,
Aowei Ding,
Kaiyi Guan,
Shixi Wu,
Yuanqi Du
Abstract:
Student performance prediction is a critical research problem to understand the students' needs, present proper learning opportunities/resources, and develop the teaching quality. However, traditional machine learning methods fail to produce stable and accurate prediction results. In this paper, we propose a graph-based ensemble machine learning method that aims to improve the stability of single…
▽ More
Student performance prediction is a critical research problem to understand the students' needs, present proper learning opportunities/resources, and develop the teaching quality. However, traditional machine learning methods fail to produce stable and accurate prediction results. In this paper, we propose a graph-based ensemble machine learning method that aims to improve the stability of single machine learning methods via the consensus of multiple methods. To be specific, we leverage both supervised prediction methods and unsupervised clustering methods, build an iterative approach that propagates in a bipartite graph as well as converges to more stable and accurate prediction results. Extensive experiments demonstrate the effectiveness of our proposed method in predicting more accurate student performance. Specifically, our model outperforms the best traditional machine learning algorithms by up to 14.8% in prediction accuracy.
△ Less
Submitted 21 December, 2021; v1 submitted 15 December, 2021;
originally announced December 2021.
-
STRIVE: Scene Text Replacement In Videos
Authors:
Vijay Kumar B G,
Jeyasri Subramanian,
Varnith Chordia,
Eugene Bart,
Shaobo Fang,
Kelly Guan,
Raja Bala
Abstract:
We propose replacing scene text in videos using deep style transfer and learned photometric transformations.Building on recent progress on still image text replacement,we present extensions that alter text while preserving the appearance and motion characteristics of the original video.Compared to the problem of still image text replacement,our method addresses additional challenges introduced by…
▽ More
We propose replacing scene text in videos using deep style transfer and learned photometric transformations.Building on recent progress on still image text replacement,we present extensions that alter text while preserving the appearance and motion characteristics of the original video.Compared to the problem of still image text replacement,our method addresses additional challenges introduced by video, namely effects induced by changing lighting, motion blur, diverse variations in camera-object pose over time,and preservation of temporal consistency. We parse the problem into three steps. First, the text in all frames is normalized to a frontal pose using a spatio-temporal trans-former network. Second, the text is replaced in a single reference frame using a state-of-art still-image text replacement method. Finally, the new text is transferred from the reference to remaining frames using a novel learned image transformation network that captures lighting and blur effects in a temporally consistent manner. Results on synthetic and challenging real videos show realistic text trans-fer, competitive quantitative and qualitative performance,and superior inference speed relative to alternatives. We introduce new synthetic and real-world datasets with paired text objects. To the best of our knowledge this is the first attempt at deep video text replacement.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
Cluster-based Characterization and Modeling for UAV Air-to-Ground Time-Varying Channels
Authors:
Zhuangzhuang Cui,
Ke Guan,
Claude Oestges,
César Briso-Rodríguez,
Bo Ai,
Zhangdui Zhong
Abstract:
With the deep integration between the unmanned aerial vehicle (UAV) and wireless communication, UAV-based air-to-ground (AG) propagation channels need more detailed descriptions and accurate models. In this paper, we aim to perform cluster-based characterization and modeling for AG channels. To our best knowledge, this is the first study that concentrates on the clustering and tracking of multipat…
▽ More
With the deep integration between the unmanned aerial vehicle (UAV) and wireless communication, UAV-based air-to-ground (AG) propagation channels need more detailed descriptions and accurate models. In this paper, we aim to perform cluster-based characterization and modeling for AG channels. To our best knowledge, this is the first study that concentrates on the clustering and tracking of multipath components (MPCs) for time-varying AG channels. Based on measurement data at 6.5 GHz with 500 MHz of bandwidth, we first estimate potential MPCs utilizing the space-alternating generalized expectation-maximization (SAGE) algorithm. Then, we cluster the extracted MPCs considering their static and dynamic characteristics by employing K-Power-Means (KPM) algorithm under multipath component distance (MCD) measure. For characterizing time-variant clusters, we exploit a clustering-based tracking (CBT) method, which efficiently quantifies the survival lengths of clusters. Ultimately, we establish a cluster-based channel model, and validations illustrate the accuracy of the proposed model. This work not only promotes a better understanding of AG propagation channels but also provides a general cluster-based AG channel model with certain extensibility.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
Coverage Analysis of Cellular-Connected UAV Communications with 3GPP Antenna and Channel Models
Authors:
Zhuangzhuang Cui,
Ke Guan,
İsmail Güvenç,
Claude Oestges,
Zhangdui Zhong
Abstract:
For reliable and efficient communications of aerial platforms, such as unmanned aerial vehicles (UAVs), the cellular network is envisioned to provide connectivity for the aerial and ground user equipment (GUE) simultaneously, which brings challenges to the existing pattern of the base station (BS) tailored for ground-level services. Thus, we focus on the coverage probability analysis to investigat…
▽ More
For reliable and efficient communications of aerial platforms, such as unmanned aerial vehicles (UAVs), the cellular network is envisioned to provide connectivity for the aerial and ground user equipment (GUE) simultaneously, which brings challenges to the existing pattern of the base station (BS) tailored for ground-level services. Thus, we focus on the coverage probability analysis to investigate the coexistence of aerial and terrestrial users, by employing realistic antenna and channel models reported in the 3rd Generation Partnership Project (3GPP). The homogeneous Poisson point process (PPP) is used to describe the BS distribution, and the BS antenna is adjustable in the down-tilted angle and the number of the antenna array. Meantime, omnidirectional antennas are used for cellular users. We first derive the approximation of coverage probability and then conduct numerous simulations to evaluate the impacts of antenna numbers, down-tilted angles, carrier frequencies, and user heights. One of the essential findings indicates that the coverage probabilities of high-altitude users become less sensitive to the down-tilted angle. Moreover, we found that the aerial user equipment (AUE) in a certain range of heights can achieve the same or better coverage probability than that of GUE, which provides an insight into the effective deployment of cellular-connected aerial communications.
△ Less
Submitted 28 May, 2021;
originally announced May 2021.
-
A Serverless Cloud-Fog Platform for DNN-Based Video Analytics with Incremental Learning
Authors:
Huaizheng Zhang,
Meng Shen,
Yizheng Huang,
Yonggang Wen,
Yong Luo,
Guanyu Gao,
Kyle Guan
Abstract:
DNN-based video analytics have empowered many new applications (e.g., automated retail). Meanwhile, the proliferation of fog devices provides developers with more design options to improve performance and save cost. To the best of our knowledge, this paper presents the first serverless system that takes full advantage of the client-fog-cloud synergy to better serve the DNN-based video analytics. S…
▽ More
DNN-based video analytics have empowered many new applications (e.g., automated retail). Meanwhile, the proliferation of fog devices provides developers with more design options to improve performance and save cost. To the best of our knowledge, this paper presents the first serverless system that takes full advantage of the client-fog-cloud synergy to better serve the DNN-based video analytics. Specifically, the system aims to achieve two goals: 1) Provide the optimal analytics results under the constraints of lower bandwidth usage and shorter round-trip time (RTT) by judiciously managing the computational and bandwidth resources deployed in the client, fog, and cloud environment. 2) Free developers from tedious administration and operation tasks, including DNN deployment, cloud and fog's resource management. To this end, we implement a holistic cloud-fog system referred to as VPaaS (Video-Platform-as-a-Service). VPaaS adopts serverless computing to enable developers to build a video analytics pipeline by simply programming a set of functions (e.g., model inference), which are then orchestrated to process videos through carefully designed modules. To save bandwidth and reduce RTT, VPaaS provides a new video streaming protocol that only sends low-quality video to the cloud. The state-of-the-art (SOTA) DNNs deployed at the cloud can identify regions of video frames that need further processing at the fog ends. At the fog ends, misidentified labels in these regions can be corrected using a light-weight DNN model. To address the data drift issues, we incorporate limited human feedback into the system to verify the results and adopt incremental learning to improve our system continuously. The evaluation demonstrates that VPaaS is superior to several SOTA systems: it maintains high accuracy while reducing bandwidth usage by up to 21%, RTT by up to 62.5%, and cloud monetary cost by up to 50%.
△ Less
Submitted 5 February, 2021;
originally announced February 2021.
-
Channel Modeling for UAV Communications: State of the Art, Case Studies, and Future Directions
Authors:
Zhuangzhuang Cui,
Ke Guan,
César Briso-Rodríguez,
Bo Ai,
Zhangdui Zhong,
Claude Oestges
Abstract:
As essential aerial platforms, unmanned aerial vehicles (UAVs) play an increasingly important role in broad wireless connectivity and high-data-rate transmission for future communication systems. Notably, various communication scenarios are involved in UAV communications, such as intercommunications between UAVs and communications with the ground user equipment, the cellular base station, and the…
▽ More
As essential aerial platforms, unmanned aerial vehicles (UAVs) play an increasingly important role in broad wireless connectivity and high-data-rate transmission for future communication systems. Notably, various communication scenarios are involved in UAV communications, such as intercommunications between UAVs and communications with the ground user equipment, the cellular base station, and the ground station, to name a few. However, existing works mostly focus on a single communication scenario, a designated channel type, and a specific operating frequency, thus urgently requiring a comprehensive understanding of multi-scenario, multi-frequency, and multi-type UAV channels. This article pours attention into the essentials of corresponding air-to-air (A2A) and air-to-ground (A2G) channels in UAV communications. We first identify the latest key challenges of channel modeling for UAV communications. We then provide the state of the art for A2A and A2G channel properties and models based on extensive measurement campaigns. In particular, we conduct realistic case studies to further demonstrate critical channel characterizations and machine learning-based modeling methods. Last but not least, potential directions are widely discussed for paving the way towards more accurate and effective channel models for UAV communications.
△ Less
Submitted 16 April, 2021; v1 submitted 11 December, 2020;
originally announced December 2020.
-
Coverage Probability Analysis of IRS-Aided Communication Systems
Authors:
Zhuangzhuang Cui,
Ke Guan,
Jiayi Zhang,
Zhangdui Zhong
Abstract:
The intelligent reflective surface (IRS) technology has received many interests in recent years, thanks to its potential uses in future wireless communications, in which one of the promising use cases is to widen coverage, especially in the line-of-sight-blocked scenarios. Therefore, it is critical to analyze the corresponding coverage probability of IRS-aided communication systems. To our best kn…
▽ More
The intelligent reflective surface (IRS) technology has received many interests in recent years, thanks to its potential uses in future wireless communications, in which one of the promising use cases is to widen coverage, especially in the line-of-sight-blocked scenarios. Therefore, it is critical to analyze the corresponding coverage probability of IRS-aided communication systems. To our best knowledge, however, previous works focusing on this issue are very limited. In this paper, we analyze the coverage probability under the Rayleigh fading channel, taking the number and size of the array elements into consideration. We first derive the exact closed-form of coverage probability for the unit element. Afterward, with the method of moment matching, the approximation of the coverage probability can be formulated as the ratio of upper incomplete Gamma function and Gamma function, allowing an arbitrary number of elements. Finally, we comprehensively evaluate the impacts of essential factors on the coverage probability, such as the coefficient of fading channel, the number and size of the element, and the angle of incidence. Overall, the paper provides a succinct and general expression of coverage probability, which can be helpful in the performance evaluation and practical implementation of the IRS.
△ Less
Submitted 5 December, 2020;
originally announced December 2020.
-
Performance and Optimization of Reconfigurable Intelligent Surface Aided THz Communications
Authors:
Hongyang Du,
Jiayi Zhang,
Ke Guan,
Dusit Niyato,
Huiying Jiao,
Zhiqin Wang,
Thomas Kürner
Abstract:
TeraHertz (THz) communications can satisfy the high data rate demand with massive bandwidth. However, severe path attenuation and hardware imperfection greatly alleviate its performance. Therefore, we utilize the reconfigurable intelligent surface (RIS) technology and investigate the RIS-aided THz communications. We first prove that the small-scale amplitude fading of THz signals can be accurately…
▽ More
TeraHertz (THz) communications can satisfy the high data rate demand with massive bandwidth. However, severe path attenuation and hardware imperfection greatly alleviate its performance. Therefore, we utilize the reconfigurable intelligent surface (RIS) technology and investigate the RIS-aided THz communications. We first prove that the small-scale amplitude fading of THz signals can be accurately modeled by the fluctuating two-ray distribution based on two THz signal measurement experiments conducted in a variety of different scenarios. To optimize the phase-shifts at the RIS elements, we propose a novel swarm intelligence-based method that does not require full channel estimation. We then derive exact statistical characterizations of end-to-end signal-to-noise plus distortion ratio (SNDR) and signal-to-noise ratio (SNR). Moreover, we present asymptotic analysis to obtain more insights when the SNDR or the number of RIS's elements is high. Finally, we derive analytical expressions for the outage probability and ergodic capacity. The tight upper bounds of ergodic capacity for both ideal and nonideal radio frequency chains are obtained. It is interesting to find that increasing the number of RIS's elements can significantly improve the THz communications system performance. For example, the ergodic capacity can increase up to 25% when the number of elements increases from 40 to 80, which incurs only insignificant costs to the system.
△ Less
Submitted 20 March, 2022; v1 submitted 1 December, 2020;
originally announced December 2020.
-
InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System
Authors:
Huaizheng Zhang,
Yizheng Huang,
Yonggang Wen,
Jianxiong Yin,
Kyle Guan
Abstract:
Deep learning (DL) models have become core modules for many applications. However, deploying these models without careful performance benchmarking that considers both hardware and software's impact often leads to poor service and costly operational expenditure. To facilitate DL models' deployment, we implement an automatic and comprehensive benchmark system for DL developers. To accomplish benchma…
▽ More
Deep learning (DL) models have become core modules for many applications. However, deploying these models without careful performance benchmarking that considers both hardware and software's impact often leads to poor service and costly operational expenditure. To facilitate DL models' deployment, we implement an automatic and comprehensive benchmark system for DL developers. To accomplish benchmark-related tasks, the developers only need to prepare a configuration file consisting of a few lines of code. Our system, deployed to a leader server in DL clusters, will dispatch users' benchmark jobs to follower workers. Next, the corresponding requests, workload, and even models can be generated automatically by the system to conduct DL serving benchmarks. Finally, developers can leverage many analysis tools and models in our system to gain insights into the trade-offs of different system configurations. In addition, a two-tier scheduler is incorporated to avoid unnecessary interference and improve average job compilation time by up to 1.43x (equivalent of 30\% reduction). Our system design follows the best practice in DL clusters operations to expedite day-to-day DL service evaluation efforts by the developers. We conduct many benchmark experiments to provide in-depth and comprehensive evaluations. We believe these results are of great values as guidelines for DL service configuration and resource allocation.
△ Less
Submitted 5 January, 2021; v1 submitted 4 November, 2020;
originally announced November 2020.
-
Scoring the Terabit/s Goal:Broadband Connectivity in 6G
Authors:
Nandana Rajatheva,
Italo Atzeni,
Simon Bicais,
Emil Bjornson,
Andre Bourdoux,
Stefano Buzzi,
Carmen D'Andrea,
Jean-Baptiste Dore,
Serhat Erkucuk,
Manuel Fuentes,
Ke Guan,
Yuzhou Hu,
Xiaojing Huang,
Jari Hulkkonen,
Josep Miquel Jornet,
Marcos Katz,
Behrooz Makki,
Rickard Nilsson,
Erdal Panayirci,
Khaled Rabie,
Nuwanthika Rajapaksha,
MohammadJavad Salehi,
Hadi Sarieddeen,
Shahriar Shahabuddin,
Tommy Svensson
, et al. (4 additional authors not shown)
Abstract:
This paper explores the road to vastly improving the broadband connectivity in future 6G wireless systems. Different categories of use cases are considered, with peak data rates up to 1 Tbps. Several categories of enablers at the infrastructure, spectrum, and protocol/algorithmic levels are required to realize the intended broadband connectivity goals in 6G. At the infrastructure level, we conside…
▽ More
This paper explores the road to vastly improving the broadband connectivity in future 6G wireless systems. Different categories of use cases are considered, with peak data rates up to 1 Tbps. Several categories of enablers at the infrastructure, spectrum, and protocol/algorithmic levels are required to realize the intended broadband connectivity goals in 6G. At the infrastructure level, we consider ultra-massive MIMO technology (possibly implemented using holographic radio), intelligent reflecting surfaces, user-centric cell-free networking, integrated access and backhaul, and integrated space and terrestrial networks. At the spectrum level, the network must seamlessly utilize sub-6 GHz bands for coverage and spatial multiplexing of many devices, while higher bands will be mainly used for pushing the peak rates of point-to-point links. Finally, at the protocol/algorithmic level, the enablers include improved coding, modulation, and waveforms to achieve lower latency, higher reliability, and reduced complexity.
△ Less
Submitted 21 February, 2021; v1 submitted 17 August, 2020;
originally announced August 2020.
-
Satellite-Terrestrial Channel Characterization in High-Speed Railway Environment at 22.6 GHz
Authors:
Lei Ma,
Ke Guan,
Dong Yan,
Danping He,
Nuno R. Leonor,
Bo Ai,
Junhyeong Kim
Abstract:
The integration of satellite and terrestrial communication systems plays a vital role in the fifth-generation mobile communication system (5G) for the ubiquitous coverage, reliable service and flexible networking. Moreover, the millimeter wave (mmWave) communication with large bandwidth is a key enabler for 5G intelligent rail transportation. In this paper, the satellite-terrestrial channel at 22.…
▽ More
The integration of satellite and terrestrial communication systems plays a vital role in the fifth-generation mobile communication system (5G) for the ubiquitous coverage, reliable service and flexible networking. Moreover, the millimeter wave (mmWave) communication with large bandwidth is a key enabler for 5G intelligent rail transportation. In this paper, the satellite-terrestrial channel at 22.6 GHz is characterized for a typical high-speed railway (HSR) environment. The three-dimensional model of the railway scenario is reconstructed and imported into the Cloud Ray-Tracing (CloudRT) simulation platform. Based on extensive ray-tracing simulations, the channel for the terrestrial HSR system and the satellite-terrestrial system with two weather conditions are characterized, and the interference between them are evaluated. The results of this paper can help for the design and evaluation for the satellite-terrestrial communication system enabling future intelligent rail transportation.
△ Less
Submitted 10 June, 2020;
originally announced June 2020.
-
MLModelCI: An Automatic Cloud Platform for Efficient MLaaS
Authors:
Huaizheng Zhang,
Yuanming Li,
Yizheng Huang,
Yonggang Wen,
Jianxiong Yin,
Kyle Guan
Abstract:
MLModelCI provides multimedia researchers and developers with a one-stop platform for efficient machine learning (ML) services. The system leverages DevOps techniques to optimize, test, and manage models. It also containerizes and deploys these optimized and validated models as cloud services (MLaaS). In its essence, MLModelCI serves as a housekeeper to help users publish models. The models are fi…
▽ More
MLModelCI provides multimedia researchers and developers with a one-stop platform for efficient machine learning (ML) services. The system leverages DevOps techniques to optimize, test, and manage models. It also containerizes and deploys these optimized and validated models as cloud services (MLaaS). In its essence, MLModelCI serves as a housekeeper to help users publish models. The models are first automatically converted to optimized formats for production purpose and then profiled under different settings (e.g., batch size and hardware). The profiling information can be used as guidelines for balancing the trade-off between performance and cost of MLaaS. Finally, the system dockerizes the models for ease of deployment to cloud environments. A key feature of MLModelCI is the implementation of a controller, which allows elastic evaluation which only utilizes idle workers while maintaining online service quality. Our system bridges the gap between current ML training and serving systems and thus free developers from manual and tedious work often associated with service deployment. We release the platform as an open-source project on GitHub under Apache 2.0 license, with the aim that it will facilitate and streamline more large-scale ML applications and research projects.
△ Less
Submitted 9 June, 2020;
originally announced June 2020.
-
White Paper on Broadband Connectivity in 6G
Authors:
Nandana Rajatheva,
Italo Atzeni,
Emil Bjornson,
Andre Bourdoux,
Stefano Buzzi,
Jean-Baptiste Dore,
Serhat Erkucuk,
Manuel Fuentes,
Ke Guan,
Yuzhou Hu,
Xiaojing Huang,
Jari Hulkkonen,
Josep Miquel Jornet,
Marcos Katz,
Rickard Nilsson,
Erdal Panayirci,
Khaled Rabie,
Nuwanthika Rajapaksha,
MohammadJavad Salehi,
Hadi Sarieddeen,
Tommy Svensson,
Oskari Tervo,
Antti Tolli,
Qingqing Wu,
Wen Xu
Abstract:
This white paper explores the road to implementing broadband connectivity in future 6G wireless systems. Different categories of use cases are considered, from extreme capacity with peak data rates up to 1 Tbps, to raising the typical data rates by orders-of-magnitude, to support broadband connectivity at railway speeds up to 1000 km/h. To achieve these goals, not only the terrestrial networks wil…
▽ More
This white paper explores the road to implementing broadband connectivity in future 6G wireless systems. Different categories of use cases are considered, from extreme capacity with peak data rates up to 1 Tbps, to raising the typical data rates by orders-of-magnitude, to support broadband connectivity at railway speeds up to 1000 km/h. To achieve these goals, not only the terrestrial networks will be evolved but they will also be integrated with satellite networks, all facilitating autonomous systems and various interconnected structures. We believe that several categories of enablers at the infrastructure, spectrum, and protocol/ algorithmic levels are required to realize the intended broadband connectivity goals in 6G. At the infrastructure level, we consider ultra-massive MIMO technology (possibly implemented using holographic radio), intelligent reflecting surfaces, user-centric and scalable cell-free networking, integrated access and backhaul, and integrated space and terrestrial networks. At the spectrum level, the network must seamlessly utilize sub-6 GHz bands for coverage and spatial multiplexing of many devices, while higher bands will be used for pushing the peak rates of point-to-point links. The latter path will lead to THz communications complemented by visible light communications in specific scenarios. At the protocol/algorithmic level, the enablers include improved coding, modulation, and waveforms to achieve lower latencies, higher reliability, and reduced complexity. Different options will be needed to optimally support different use cases. The resource efficiency can be further improved by using various combinations of full-duplex radios, interference management based on rate-splitting, machine-learning-based optimization, coded caching, and broadcasting.
△ Less
Submitted 29 April, 2020;
originally announced April 2020.
-
DeepMask: an algorithm for cloud and cloud shadow detection in optical satellite remote sensing images using deep residual network
Authors:
Ke Xu,
Kaiyu Guan,
Jian Peng,
Yunan Luo,
Sibo Wang
Abstract:
Detecting and masking cloud and cloud shadow from satellite remote sensing images is a pervasive problem in the remote sensing community. Accurate and efficient detection of cloud and cloud shadow is an essential step to harness the value of remotely sensed data for almost all downstream analysis. DeepMask, a new algorithm for cloud and cloud shadow detection in optical satellite remote sensing im…
▽ More
Detecting and masking cloud and cloud shadow from satellite remote sensing images is a pervasive problem in the remote sensing community. Accurate and efficient detection of cloud and cloud shadow is an essential step to harness the value of remotely sensed data for almost all downstream analysis. DeepMask, a new algorithm for cloud and cloud shadow detection in optical satellite remote sensing imagery, is proposed in this study. DeepMask utilizes ResNet, a deep convolutional neural network, for pixel-level cloud mask generation. The algorithm is trained and evaluated on the Landsat 8 Cloud Cover Assessment Validation Dataset distributed across 8 different land types. Compared with CFMask, the most widely used cloud detection algorithm, land-type-specific DeepMask models achieve higher accuracy across all land types. The average accuracy is 93.56%, compared with 85.36% from CFMask. DeepMask also achieves 91.02% accuracy on all-land-type dataset. Compared with other CNN-based cloud mask algorithms, DeepMask benefits from the parsimonious architecture and the residual connection of ResNet. It is compatible with input of any size and shape. DeepMask still maintains high performance when using only red, green, blue, and NIR bands, indicating its potential to be applied to other satellite platforms that only have limited optical bands.
△ Less
Submitted 8 November, 2019;
originally announced November 2019.
-
Analytical Modeling of UAV-to-Vehicle Propagation Channels in Built-Up Areas
Authors:
Zhuangzhuang Cui,
Ke Guan,
César Briso,
Danping He,
Jianqiao Cheng,
Zhangdui Zhong,
François Quitin
Abstract:
This letter presents an analytical path loss model for air-ground (AG) propagation between unmanned aerial vehicles (UAVs) and ground-based vehicles. We consider built-up areas, such as the ones defined by ITU-R. The three-dimensional (3D) path loss model is based on propagation conditions and essential parameters are derived by using geometric methods. Owing to the generality, the analytical mode…
▽ More
This letter presents an analytical path loss model for air-ground (AG) propagation between unmanned aerial vehicles (UAVs) and ground-based vehicles. We consider built-up areas, such as the ones defined by ITU-R. The three-dimensional (3D) path loss model is based on propagation conditions and essential parameters are derived by using geometric methods. Owing to the generality, the analytical model is capable of arbitrary deployments of buildings, such as suburban, urban and dense urban. The analytical model is evaluated numerically, and validations conducted by ray-tracing simulations show the high accuracy of the proposed model. The closed-form analytical formulas provide a useful tool for quick and accurate prediction of UAV-to-vehicle propagation channels.
△ Less
Submitted 26 June, 2019;
originally announced July 2019.
-
Probabilistic Two-Ray Model for Air-to-Air Channel in Built-Up Areas
Authors:
Zhuangzhuang Cui,
Ke Guan,
César Briso,
Danping He,
Bo Ai,
Zhangdui Zhong
Abstract:
In this paper, we present a probabilistic two-ray (PTR) path loss model for air-to-air (AA) propagation channel in built-up areas. Based on the statistical model of city deployment, the PTR path loss model can be applied to suburban, urban, dense urban, and high-rise urban. The path loss is optimally fitted as the Weibull distribution and its fluctuation is fitted as the Normal distribution in ray…
▽ More
In this paper, we present a probabilistic two-ray (PTR) path loss model for air-to-air (AA) propagation channel in built-up areas. Based on the statistical model of city deployment, the PTR path loss model can be applied to suburban, urban, dense urban, and high-rise urban. The path loss is optimally fitted as the Weibull distribution and its fluctuation is fitted as the Normal distribution in ray-tracing simulations. The good agreements between our model and ray tracing indicate the proposed model can provide a useful tool for accurate and quick prediction for aerial platforms. As an extended research of PTR model, we extract the shadowing factor by numerous simulations and propose the altitude-dependent shadowing model. The result shows that the proposed shadowing model has very good consistent with the measurement-based model, which indicates that our research performs well in the extensibility and generality.
△ Less
Submitted 26 June, 2019;
originally announced June 2019.
-
Reinforcement Learning Experience Reuse with Policy Residual Representation
Authors:
Wen-Ji Zhou,
Yang Yu,
Yingfeng Chen,
Kai Guan,
Tangjie Lv,
Changjie Fan,
Zhi-Hua Zhou
Abstract:
Experience reuse is key to sample-efficient reinforcement learning. One of the critical issues is how the experience is represented and stored. Previously, the experience can be stored in the forms of features, individual models, and the average model, each lying at a different granularity. However, new tasks may require experience across multiple granularities. In this paper, we propose the polic…
▽ More
Experience reuse is key to sample-efficient reinforcement learning. One of the critical issues is how the experience is represented and stored. Previously, the experience can be stored in the forms of features, individual models, and the average model, each lying at a different granularity. However, new tasks may require experience across multiple granularities. In this paper, we propose the policy residual representation (PRR) network, which can extract and store multiple levels of experience. PRR network is trained on a set of tasks with a multi-level architecture, where a module in each level corresponds to a subset of the tasks. Therefore, the PRR network represents the experience in a spectrum-like way. When training on a new task, PRR can provide different levels of experience for accelerating the learning. We experiment with the PRR network on a set of grid world navigation tasks, locomotion tasks, and fighting tasks in a video game. The results show that the PRR network leads to better reuse of experience and thus outperforms some state-of-the-art approaches.
△ Less
Submitted 31 May, 2019;
originally announced May 2019.
-
Resource Allocation for Device-to-Device Communications Underlaying Heterogeneous Cellular Networks Using Coalitional Games
Authors:
Yali Chen,
Bo Ai,
Yong Niu,
Ke Guan,
Zhu Han
Abstract:
Heterogeneous cellular networks (HCNs) with millimeter wave (mmWave) communications included are emerging as a promising candidate for the fifth generation mobile network. With highly directional antenna arrays, mmWave links are able to provide several-Gbps transmission rate. However, mmWave links are easily blocked without line of sight. On the other hand, D2D communications have been proposed to…
▽ More
Heterogeneous cellular networks (HCNs) with millimeter wave (mmWave) communications included are emerging as a promising candidate for the fifth generation mobile network. With highly directional antenna arrays, mmWave links are able to provide several-Gbps transmission rate. However, mmWave links are easily blocked without line of sight. On the other hand, D2D communications have been proposed to support many content based applications, and need to share resources with users in HCNs to improve spectral reuse and enhance system capacity. Consequently, an efficient resource allocation scheme for D2D pairs among both mmWave and the cellular carrier band is needed. In this paper, we first formulate the problem of the resource allocation among mmWave and the cellular band for multiple D2D pairs from the view point of game theory. Then, with the characteristics of cellular and mmWave communications considered, we propose a coalition formation game to maximize the system sum rate in statistical average sense. We also theoretically prove that our proposed game converges to a Nash-stable equilibrium and further reaches the near-optimal solution with fast convergence rate. Through extensive simulations under various system parameters, we demonstrate the superior performance of our scheme in terms of the system sum rate compared with several other practical schemes.
△ Less
Submitted 6 November, 2018;
originally announced November 2018.
-
Intelligent Trainer for Model-Based Reinforcement Learning
Authors:
Yuanlong Li,
Linsen Dong,
Xin Zhou,
Yonggang Wen,
Kyle Guan
Abstract:
Model-based reinforcement learning (MBRL) has been proposed as a promising alternative solution to tackle the high sampling cost challenge in the canonical reinforcement learning (RL), by leveraging a learned model to generate synthesized data for policy training purpose. The MBRL framework, nevertheless, is inherently limited by the convoluted process of jointly learning control policy and config…
▽ More
Model-based reinforcement learning (MBRL) has been proposed as a promising alternative solution to tackle the high sampling cost challenge in the canonical reinforcement learning (RL), by leveraging a learned model to generate synthesized data for policy training purpose. The MBRL framework, nevertheless, is inherently limited by the convoluted process of jointly learning control policy and configuring hyper-parameters (e.g., global/local models, real and synthesized data, etc). The training process could be tedious and prohibitively costly. In this research, we propose an "reinforcement on reinforcement" (RoR) architecture to decompose the convoluted tasks into two layers of reinforcement learning. The inner layer is the canonical model-based RL training process environment (TPE), which learns the control policy for the underlying system and exposes interfaces to access states, actions and rewards. The outer layer presents an RL agent, called as AI trainer, to learn an optimal hyper-parameter configuration for the inner TPE. This decomposition approach provides a desirable flexibility to implement different trainer designs, called as "train the trainer". In our research, we propose and optimize two alternative trainer designs: 1) a uni-head trainer and 2) a multi-head trainer. Our proposed RoR framework is evaluated for five tasks in the OpenAI gym (i.e., Pendulum, Mountain Car, Reacher, Half Cheetah and Swimmer). Compared to three other baseline algorithms, our proposed Train-the-Trainer algorithm has a competitive performance in auto-tuning capability, with upto 56% expected sampling cost saving without knowing the best parameter setting in advance. The proposed trainer framework can be easily extended to other cases in which the hyper-parameter tuning is costly.
△ Less
Submitted 5 June, 2019; v1 submitted 23 May, 2018;
originally announced May 2018.
-
The asymmetric quantum Rabi model and generalised Pöschl-Teller potentials
Authors:
Kai-Long Guan,
Zi-Min Li,
Clare Dunning,
Murray T Batchelor
Abstract:
Starting with the Gaudin-like Bethe ansatz equations associated with the quasi-exactly solved (QES) exceptional points of the asymmetric quantum Rabi model (AQRM) a spectral equivalence is established with QES hyperbolic Schrödinger potentials on the line. This leads to particular QES Pöschl-Teller potentials. The complete spectral equivalence is then established between the AQRM and generalised P…
▽ More
Starting with the Gaudin-like Bethe ansatz equations associated with the quasi-exactly solved (QES) exceptional points of the asymmetric quantum Rabi model (AQRM) a spectral equivalence is established with QES hyperbolic Schrödinger potentials on the line. This leads to particular QES Pöschl-Teller potentials. The complete spectral equivalence is then established between the AQRM and generalised Pöschl-Teller potentials. This result extends a previous mapping between the symmetric quantum Rabi model and a QES Pöschl-Teller potential. The complete spectral equivalence between the two systems suggests that the physics of the generalised Pöschl-Teller potentials may also be explored in experimental realisations of the quantum Rabi model.
△ Less
Submitted 6 June, 2018; v1 submitted 11 April, 2018;
originally announced April 2018.
-
DeepQoE: A unified Framework for Learning to Predict Video QoE
Authors:
Huaizheng Zhang,
Han Hu,
Guanyu Gao,
Yonggang Wen,
Kyle Guan
Abstract:
Motivated by the prowess of deep learning (DL) based techniques in prediction, generalization, and representation learning, we develop a novel framework called DeepQoE to predict video quality of experience (QoE). The end-to-end framework first uses a combination of DL techniques (e.g., word embeddings) to extract generalized features. Next, these features are combined and fed into a neural networ…
▽ More
Motivated by the prowess of deep learning (DL) based techniques in prediction, generalization, and representation learning, we develop a novel framework called DeepQoE to predict video quality of experience (QoE). The end-to-end framework first uses a combination of DL techniques (e.g., word embeddings) to extract generalized features. Next, these features are combined and fed into a neural network for representation learning. Such representations serve as inputs for classification or regression tasks. Evaluating the performance of DeepQoE with two datasets, we show that for the small dataset, the accuracy of all shallow learning algorithm is improved by using the representation derived from DeepQoE. For the large dataset, our DeepQoE framework achieves significant performance improvement in comparison to the best baseline method (90.94% vs. 82.84%). Moreover, DeepQoE, also released as an open source tool, provides video QoE research much-needed flexibility in fitting different datasets, extracting generalized features, and learning representations.
△ Less
Submitted 10 April, 2018;
originally announced April 2018.
-
Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning
Authors:
Yuanlong Li,
Yonggang Wen,
Kyle Guan,
Dacheng Tao
Abstract:
Cooling system plays a critical role in a modern data center (DC). Developing an optimal control policy for DC cooling system is a challenging task. The prevailing approaches often rely on approximating system models that are built upon the knowledge of mechanical cooling, electrical and thermal management, which is difficult to design and may lead to sub-optimal or unstable performances. In this…
▽ More
Cooling system plays a critical role in a modern data center (DC). Developing an optimal control policy for DC cooling system is a challenging task. The prevailing approaches often rely on approximating system models that are built upon the knowledge of mechanical cooling, electrical and thermal management, which is difficult to design and may lead to sub-optimal or unstable performances. In this paper, we propose utilizing the large amount of monitoring data in DC to optimize the control policy. To do so, we cast the cooling control policy design into an energy cost minimization problem with temperature constraints, and tap it into the emerging deep reinforcement learning (DRL) framework. Specifically, we propose an end-to-end cooling control algorithm (CCA) that is based on the actor-critic framework and an off-policy offline version of the deep deterministic policy gradient (DDPG) algorithm. In the proposed CCA, an evaluation network is trained to predict an energy cost counter penalized by the cooling status of the DC room, and a policy network is trained to predict optimized control settings when gave the current load and weather information. The proposed algorithm is evaluated on the EnergyPlus simulation platform and on a real data trace collected from the National Super Computing Centre (NSCC) of Singapore. Our results show that the proposed CCA can achieve about 11% cooling cost saving on the simulation platform compared with a manually configured baseline control algorithm. In the trace-based study, we propose a de-underestimation validation mechanism as we cannot directly test the algorithm on a real DC. Even though with DUE the results are conservative, we can still achieve about 15% cooling energy saving on the NSCC data trace if we set the inlet temperature threshold at 26.6 degree Celsius.
△ Less
Submitted 18 July, 2018; v1 submitted 15 September, 2017;
originally announced September 2017.
-
A Fan Beam Model for Radio Pulsars. I. Observational Evidence
Authors:
Hong Guang Wang,
Fei Peng Pi,
Xiao Ping Zheng,
Chun Lan Deng,
Sai Qin Wen,
Feng Ye,
Kai Ying Guan,
Yi Liu,
Li Qing Xu
Abstract:
We propose a novel beam model for radio pulsars based on the scenario that the broadband and coherent emission from secondary relativistic particles, as they move along a flux tube in a dipolar magnetic field, forms a radially extended sub-beam with unique properties. The whole radio beam may consist of several sub-beams, forming a fan-shaped pattern. When only one or a few flux tubes are active,…
▽ More
We propose a novel beam model for radio pulsars based on the scenario that the broadband and coherent emission from secondary relativistic particles, as they move along a flux tube in a dipolar magnetic field, forms a radially extended sub-beam with unique properties. The whole radio beam may consist of several sub-beams, forming a fan-shaped pattern. When only one or a few flux tubes are active, the fan beam becomes very patchy. This model differs essentially from the conal beam models in the respects of beam structure and predictions on the relationship between pulse width and impact angle $β$ (the angle between line of sight and magnetic pole) and the relationship between emission intensity and beam angular radius. The evidence for this model comes from the observed patchy beams of precessional binary pulsars and three statistical relationships found for a sample of 64 pulsars, of which $β$ were mostly constrained by fitting polarization position angle data with the Rotation Vector Model. With appropriate assumptions, the fan beam model can reproduce the relationship between 10\% peak pulse width and $|β|$, the anticorrelation between the emission intensity and $|β|$, and the upper boundary line in the scatter plot of $|β|$ versus pulsar distance. An extremely patchy beam model with the assumption of narrowband emission from one or a few flux tubes is studied and found unlikely to be a general model. The implications of the fan beam model to the studies on radio and gamma-ray pulsar populations and radio polarization are discussed.
△ Less
Submitted 27 May, 2014;
originally announced May 2014.
-
Generalized Floquet Exponent, Attractiveness Portrait and Structure Hidden in an Attractor
Authors:
Keying Guan
Abstract:
The generalized Floquet exponent and the attractiveness portrait (or A-portrait for short) of the attractor and of the smallest invariant closed set are suggested to be used for the study of dynamical systems. Based on the A-portrait, some simple structures hidden in a complicated attractor may emerge from an attractor with complicated structure. The hidden structure plays important role in the bi…
▽ More
The generalized Floquet exponent and the attractiveness portrait (or A-portrait for short) of the attractor and of the smallest invariant closed set are suggested to be used for the study of dynamical systems. Based on the A-portrait, some simple structures hidden in a complicated attractor may emerge from an attractor with complicated structure. The hidden structure plays important role in the bifurcation phenomena of the invariant sets. The examples of A-portraits for the Van der Pol limit cycle, for Lorenz attractor, for the closed limit orbits of different rotation numbers and complicated attractors of Silnikov equation, and for three interlocked smallest invariant closed set of the new improved Nosé-Hoover oscillator are given.
△ Less
Submitted 6 March, 2014;
originally announced March 2014.
-
Important Notes on Lyapunov Exponents
Authors:
Keying Guan
Abstract:
It is shown that the famous Lyapunov exponents cannot be used as the numerical characteristic for distinguishing different kinds of attractors, such as the equilibrium point, the limit closed curve, the stable torus and the strange attractor.
It is shown that the famous Lyapunov exponents cannot be used as the numerical characteristic for distinguishing different kinds of attractors, such as the equilibrium point, the limit closed curve, the stable torus and the strange attractor.
△ Less
Submitted 14 January, 2014;
originally announced January 2014.
-
Period-doubling cascades of a Silnikov equation
Authors:
Keying Guan,
Beiye Feng
Abstract:
Based on numerical results of a Silnikov equation, three period-doubling cascades, corresponding respectively to three different characters of the rotation number of a limit closed orbit, are studied, and the Feigenbaum constant is used successfully in the estimation of the critical parameter values for the period-doubling bifurcation. The conceptions of separaror and pseudo-attractor are also int…
▽ More
Based on numerical results of a Silnikov equation, three period-doubling cascades, corresponding respectively to three different characters of the rotation number of a limit closed orbit, are studied, and the Feigenbaum constant is used successfully in the estimation of the critical parameter values for the period-doubling bifurcation. The conceptions of separaror and pseudo-attractor are also introduced in the discussion part of this paper.
△ Less
Submitted 15 December, 2013; v1 submitted 6 December, 2013;
originally announced December 2013.
-
Non-trivial Local Attractors of a Three-dimensional Dynamical System
Authors:
Keying Guan
Abstract:
Based on both qualitative method and numerical tests for a series of particular cases in the parameter region, a=1, 0<b <1, it is shown that the three-dimensional system (2) may have a series of interesting phenomena on the non-trivial local attractors, such as the faint attractor (this term is suggested by the author), the local attractor with complex structure, twin spatial limit closed orbits,…
▽ More
Based on both qualitative method and numerical tests for a series of particular cases in the parameter region, a=1, 0<b <1, it is shown that the three-dimensional system (2) may have a series of interesting phenomena on the non-trivial local attractors, such as the faint attractor (this term is suggested by the author), the local attractor with complex structure, twin spatial limit closed orbits, the bifurcation of rotation numbers, and the spatial limit cycle, etc.. The system (2) is a very rich source in the study of dynamical system theory.
△ Less
Submitted 24 December, 2013; v1 submitted 24 November, 2013;
originally announced November 2013.
-
Rate-Distortion-Based Physical Layer Secrecy with Applications to Multimode Fiber
Authors:
Eva C. Song,
Emina Soljanin,
Paul Cuff,
H. Vincent Poor,
Kyle Guan
Abstract:
Optical networks are vulnerable to physical layer attacks; wiretappers can improperly receive messages intended for legitimate recipients. Our work considers an aspect of this security problem within the domain of multimode fiber (MMF) transmission. MMF transmission can be modeled via a broadcast channel in which both the legitimate receiver's and wiretapper's channels are multiple-input-multiple-…
▽ More
Optical networks are vulnerable to physical layer attacks; wiretappers can improperly receive messages intended for legitimate recipients. Our work considers an aspect of this security problem within the domain of multimode fiber (MMF) transmission. MMF transmission can be modeled via a broadcast channel in which both the legitimate receiver's and wiretapper's channels are multiple-input-multiple-output complex Gaussian channels. Source-channel coding analyses based on the use of distortion as the metric for secrecy are developed. Alice has a source sequence to be encoded and transmitted over this broadcast channel so that the legitimate user Bob can reliably decode while forcing the distortion of wiretapper, or eavesdropper, Eve's estimate as high as possible. Tradeoffs between transmission rate and distortion under two extreme scenarios are examined: the best case where Eve has only her channel output and the worst case where she also knows the past realization of the source. It is shown that under the best case, an operationally separate source-channel coding scheme guarantees maximum distortion at the same rate as needed for reliable transmission. Theoretical bounds are given, and particularized for MMF. Numerical results showing the rate distortion tradeoff are presented and compared with corresponding results for the perfect secrecy case.
△ Less
Submitted 29 December, 2013; v1 submitted 15 April, 2013;
originally announced April 2013.