subscribe to arXiv mailings

arXiv:2409.20075 [pdf, other]

BSharedRAG: Backbone Shared Retrieval-Augmented Generation for the E-commerce Domain

Authors: Kaisi Guan, Qian Cao, Yuchong Sun, Xiting Wang, Ruihua Song

Abstract: Retrieval Augmented Generation (RAG) system is important in domains such as e-commerce, which has many long-tail entities and frequently updated information. Most existing works adopt separate modules for retrieval and generation, which may be suboptimal since the retrieval task and the generation task cannot benefit from each other to improve performance. We propose a novel Backbone Shared RAG fr… ▽ More Retrieval Augmented Generation (RAG) system is important in domains such as e-commerce, which has many long-tail entities and frequently updated information. Most existing works adopt separate modules for retrieval and generation, which may be suboptimal since the retrieval task and the generation task cannot benefit from each other to improve performance. We propose a novel Backbone Shared RAG framework (BSharedRAG). It first uses a domain-specific corpus to continually pre-train a base model as a domain-specific backbone model and then trains two plug-and-play Low-Rank Adaptation (LoRA) modules based on the shared backbone to minimize retrieval and generation losses respectively. Experimental results indicate that our proposed BSharedRAG outperforms baseline models by 5% and 13% in Hit@3 upon two datasets in retrieval evaluation and by 23% in terms of BLEU-3 in generation evaluation. Our codes, models, and dataset are available at https://bsharedrag.github.io. △ Less

Submitted 30 September, 2024; originally announced September 2024.

Comments: EMNLP 2024 findings

arXiv:2408.14158 [pdf, other]

Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

Authors: Wei An, Xiao Bi, Guanting Chen, Shanhuang Chen, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Wenjun Gao, Kang Guan, Jianzhong Guo, Yongqiang Guo, Zhe Fu, Ying He, Panpan Huang, Jiashi Li, Wenfeng Liang, Xiaodong Liu, Xin Liu, Yiyuan Liu, Yuxuan Liu, Shanghao Lu, Xuan Lu, Xiaotao Nie, Tian Pei , et al. (27 additional authors not shown)

Abstract: The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic… ▽ More The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic hardware-software co-design framework and its best practices. For DL training, we deployed the Fire-Flyer 2 with 10,000 PCIe A100 GPUs, achieved performance approximating the DGX-A100 while reducing costs by half and energy consumption by 40%. We specifically engineered HFReduce to accelerate allreduce communication and implemented numerous measures to keep our Computation-Storage Integrated Network congestion-free. Through our software stack, including HaiScale, 3FS, and HAI-Platform, we achieved substantial scalability by overlapping computation and communication. Our system-oriented experience from DL training provides valuable insights to drive future advancements in AI-HPC. △ Less

Submitted 31 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

Comments: This is the preprint version of the paper accepted for presentation at the 2024 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'24). \c{opyright} 2024 IEEE. Personal use of this material is permitted. For other uses, permission from IEEE must be obtained. Please refer to IEEE Xplore for the final published version

arXiv:2406.11931 [pdf, other]

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Authors: DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen , et al. (15 additional authors not shown)

Abstract: We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathe… ▽ More We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathematical reasoning capabilities of DeepSeek-V2, while maintaining comparable performance in general language tasks. Compared to DeepSeek-Coder-33B, DeepSeek-Coder-V2 demonstrates significant advancements in various aspects of code-related tasks, as well as reasoning and general capabilities. Additionally, DeepSeek-Coder-V2 expands its support for programming languages from 86 to 338, while extending the context length from 16K to 128K. In standard benchmark evaluations, DeepSeek-Coder-V2 achieves superior performance compared to closed-source models such as GPT4-Turbo, Claude 3 Opus, and Gemini 1.5 Pro in coding and math benchmarks. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2405.04434 [pdf, other]

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference through significantly compressing the Key-Value (KV) cache into a latent vector, while DeepSeekMoE enables training strong models at an economical cost through sparse computation. Compared with DeepSeek 67B, DeepSeek-V2 achieves significantly stronger performance, and meanwhile saves 42.5% of training costs, reduces the KV cache by 93.3%, and boosts the maximum generation throughput to 5.76 times. We pretrain DeepSeek-V2 on a high-quality and multi-source corpus consisting of 8.1T tokens, and further perform Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) to fully unlock its potential. Evaluation results show that, even with only 21B activated parameters, DeepSeek-V2 and its chat versions still achieve top-tier performance among open-source models. △ Less

Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

arXiv:2404.04242 [pdf, other]

Physical Property Understanding from Language-Embedded Feature Fields

Authors: Albert J. Zhai, Yuan Shen, Emily Y. Chen, Gloria X. Wang, Xinlei Wang, Sheng Wang, Kaiyu Guan, Shenlong Wang

Abstract: Can computers perceive the physical properties of objects solely through vision? Research in cognitive science and vision science has shown that humans excel at identifying materials and estimating their physical properties based purely on visual appearance. In this paper, we present a novel approach for dense prediction of the physical properties of objects using a collection of images. Inspired… ▽ More Can computers perceive the physical properties of objects solely through vision? Research in cognitive science and vision science has shown that humans excel at identifying materials and estimating their physical properties based purely on visual appearance. In this paper, we present a novel approach for dense prediction of the physical properties of objects using a collection of images. Inspired by how humans reason about physics through vision, we leverage large language models to propose candidate materials for each object. We then construct a language-embedded point cloud and estimate the physical properties of each 3D point using a zero-shot kernel regression approach. Our method is accurate, annotation-free, and applicable to any object in the open world. Experiments demonstrate the effectiveness of the proposed approach in various physical property reasoning tasks, such as estimating the mass of common objects, as well as other properties like friction and hardness. △ Less

Submitted 5 April, 2024; originally announced April 2024.

Comments: CVPR 2024. Project page (with code): https://ajzhai.github.io/NeRF2Physics/

arXiv:2402.12685 [pdf, other]

XRL-Bench: A Benchmark for Evaluating and Comparing Explainable Reinforcement Learning Techniques

Authors: Yu Xiong, Zhipeng Hu, Ye Huang, Runze Wu, Kai Guan, Xingchen Fang, Ji Jiang, Tianze Zhou, Yujing Hu, Haoyu Liu, Tangjie Lyu, Changjie Fan

Abstract: Reinforcement Learning (RL) has demonstrated substantial potential across diverse fields, yet understanding its decision-making process, especially in real-world scenarios where rationality and safety are paramount, is an ongoing challenge. This paper delves in to Explainable RL (XRL), a subfield of Explainable AI (XAI) aimed at unravelling the complexities of RL models. Our focus rests on state-e… ▽ More Reinforcement Learning (RL) has demonstrated substantial potential across diverse fields, yet understanding its decision-making process, especially in real-world scenarios where rationality and safety are paramount, is an ongoing challenge. This paper delves in to Explainable RL (XRL), a subfield of Explainable AI (XAI) aimed at unravelling the complexities of RL models. Our focus rests on state-explaining techniques, a crucial subset within XRL methods, as they reveal the underlying factors influencing an agent's actions at any given time. Despite their significant role, the lack of a unified evaluation framework hinders assessment of their accuracy and effectiveness. To address this, we introduce XRL-Bench, a unified standardized benchmark tailored for the evaluation and comparison of XRL methods, encompassing three main modules: standard RL environments, explainers based on state importance, and standard evaluators. XRL-Bench supports both tabular and image data for state explanation. We also propose TabularSHAP, an innovative and competitive XRL method. We demonstrate the practical utility of TabularSHAP in real-world online gaming services and offer an open-source benchmark platform for the straightforward implementation and evaluation of XRL methods. Our contributions facilitate the continued progression of XRL technology. △ Less

Submitted 19 February, 2024; originally announced February 2024.

Comments: 10 pages, 5 figures

arXiv:2401.05236 [pdf, other]

Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects

Authors: Tianhang Cheng, Wei-Chiu Ma, Kaiyu Guan, Antonio Torralba, Shenlong Wang

Abstract: Our world is full of identical objects (\emphe.g., cans of coke, cars of same model). These duplicates, when seen together, provide additional and strong cues for us to effectively reason about 3D. Inspired by this observation, we introduce Structure from Duplicates (SfD), a novel inverse graphics framework that reconstructs geometry, material, and illumination from a single image containing multi… ▽ More Our world is full of identical objects (\emphe.g., cans of coke, cars of same model). These duplicates, when seen together, provide additional and strong cues for us to effectively reason about 3D. Inspired by this observation, we introduce Structure from Duplicates (SfD), a novel inverse graphics framework that reconstructs geometry, material, and illumination from a single image containing multiple identical objects. SfD begins by identifying multiple instances of an object within an image, and then jointly estimates the 6DoF pose for all instances.An inverse graphics pipeline is subsequently employed to jointly reason about the shape, material of the object, and the environment light, while adhering to the shared geometry and material constraint across instances. Our primary contributions involve utilizing object duplicates as a robust prior for single-image inverse graphics and proposing an in-plane rotation-robust Structure from Motion (SfM) formulation for joint 6-DoF object pose estimation. By leveraging multi-view cues from a single image, SfD generates more realistic and detailed 3D reconstructions, significantly outperforming existing single image reconstruction models and multi-view reconstruction approaches with a similar or greater number of observations. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: Code: https://github.com/Tianhang-Cheng/SfD

arXiv:2401.02954 [pdf, other]

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Authors: DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li , et al. (63 additional authors not shown)

Abstract: The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B… ▽ More The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B and 67B. Guided by the scaling laws, we introduce DeepSeek LLM, a project dedicated to advancing open-source language models with a long-term perspective. To support the pre-training phase, we have developed a dataset that currently consists of 2 trillion tokens and is continuously expanding. We further conduct supervised fine-tuning (SFT) and Direct Preference Optimization (DPO) on DeepSeek LLM Base models, resulting in the creation of DeepSeek Chat models. Our evaluation results demonstrate that DeepSeek LLM 67B surpasses LLaMA-2 70B on various benchmarks, particularly in the domains of code, mathematics, and reasoning. Furthermore, open-ended evaluations reveal that DeepSeek LLM 67B Chat exhibits superior performance compared to GPT-3.5. △ Less

Submitted 5 January, 2024; originally announced January 2024.

arXiv:2312.13790 [pdf, other]

From Past to Future: Digital Methods Towards Artefact Analysis

Authors: Andrew Harris, Andrea Cremaschi, Tse Siang Lim, Maria De Iorio, Kwa Chong Guan

Abstract: Over the past two decades, Digital Humanities has transformed the landscape of humanities and social sciences, enabling advanced computational analysis and interpretation of extensive datasets. Notably, recent initiatives in Southeast Asia, particularly in Singapore, focus on categorising and archiving historical data such as artwork, literature and, most notably archaeological artefacts. This stu… ▽ More Over the past two decades, Digital Humanities has transformed the landscape of humanities and social sciences, enabling advanced computational analysis and interpretation of extensive datasets. Notably, recent initiatives in Southeast Asia, particularly in Singapore, focus on categorising and archiving historical data such as artwork, literature and, most notably archaeological artefacts. This study illustrates the profound potential of Digital Humanities through the application of statistical methods on two distinct artefact datasets. Specifically, we present the results of an automated die study on mid-1st millennium AD "Rising Sun" coinage from mainland Southeast Asia, while subsequently utilising unsupervised statistical methods on 2D images of 13th-14th century earthenware ceramics excavated from the precolonial St. Andrew's Cathedral site in central Singapore. This research offers a comparative assessment showcasing the transformative impact of statistics-based approaches on the interpretation and analysis of diverse archaeological materials and within Digital Humanities overall. △ Less

Submitted 21 December, 2023; originally announced December 2023.

arXiv:2305.18612 [pdf, other]

doi 10.1145/3580305.3599444

Networked Time Series Imputation via Position-aware Graph Enhanced Variational Autoencoders

Authors: Dingsu Wang, Yuchen Yan, Ruizhong Qiu, Yada Zhu, Kaiyu Guan, Andrew J Margenot, Hanghang Tong

Abstract: Multivariate time series (MTS) imputation is a widely studied problem in recent years. Existing methods can be divided into two main groups, including (1) deep recurrent or generative models that primarily focus on time series features, and (2) graph neural networks (GNNs) based models that utilize the topological information from the inherent graph structure of MTS as relational inductive bias fo… ▽ More Multivariate time series (MTS) imputation is a widely studied problem in recent years. Existing methods can be divided into two main groups, including (1) deep recurrent or generative models that primarily focus on time series features, and (2) graph neural networks (GNNs) based models that utilize the topological information from the inherent graph structure of MTS as relational inductive bias for imputation. Nevertheless, these methods either neglect topological information or assume the graph structure is fixed and accurately known. Thus, they fail to fully utilize the graph dynamics for precise imputation in more challenging MTS data such as networked time series (NTS), where the underlying graph is constantly changing and might have missing edges. In this paper, we propose a novel approach to overcome these limitations. First, we define the problem of imputation over NTS which contains missing values in both node time series features and graph structures. Then, we design a new model named PoGeVon which leverages variational autoencoder (VAE) to predict missing values over both node time series features and graph structures. In particular, we propose a new node position embedding based on random walk with restart (RWR) in the encoder with provable higher expressive power compared with message-passing based graph neural networks (GNNs). We further design a decoder with 3-stage predictions from the perspective of multi-task learning to impute missing values in both time series and graph structures reciprocally. Experiment results demonstrate the effectiveness of our model over baselines. △ Less

Submitted 26 June, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

Comments: KDD 2023

arXiv:2305.15184 [pdf, other]

doi 10.1109/TITS.2024.3362515

6G Enabled Advanced Transportation Systems

Authors: Ruiqi Liu, Meng Hua, Ke Guan, Xiping Wang, Leyi Zhang, Tianqi Mao, Di Zhang, Qingqing Wu, Abbas Jamalipour

Abstract: With the emergence of communication services with stringent requirements such as autonomous driving or on-flight Internet, the sixth-generation (6G) wireless network is envisaged to become an enabling technology for future transportation systems. In this paper, two ways of interactions between 6G networks and transportation are extensively investigated. On one hand, the new usage scenarios and cap… ▽ More With the emergence of communication services with stringent requirements such as autonomous driving or on-flight Internet, the sixth-generation (6G) wireless network is envisaged to become an enabling technology for future transportation systems. In this paper, two ways of interactions between 6G networks and transportation are extensively investigated. On one hand, the new usage scenarios and capabilities of 6G over existing cellular networks are firstly highlighted. Then, its potential in seamless and ubiquitous connectivity across the heterogeneous space-air-ground transportation systems is demonstrated, where railways, airplanes, high-altitude platforms and satellites are investigated. On the other hand, we reveal that the introduction of 6G guarantees a more intelligent, efficient and secure transportation system. Specifically, technical analysis on how 6G can empower future transportation is provided, based on the latest research and standardization progresses in localization, integrated sensing and communications, and security. The technical challenges and insights for a road ahead are also summarized for possible inspirations on 6G enabled advanced transportation. △ Less

Submitted 11 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: Submitted to IEEE Transactions on Intelligent Transportation Systems (T-ITS)

Journal ref: IEEE Transactions on Intelligent Transportation Systems (2024) 1-17

arXiv:2305.03704 [pdf, other]

A 3D Modeling Method for Scattering on Rough Surfaces at the Terahertz Band

Authors: Ben Chen, Ke Guan, Danping He, Pengxiang Xie, Zhangdui Zhong, Jianwu Dou, Shahid Mumtaz, Wael Bazzi

Abstract: The terahertz (THz) band (0.1-10 THz) is widely considered to be a candidate band for the sixth-generation mobile communication technology (6G). However, due to its short wavelength (less than 1 mm), scattering becomes a particularly significant propagation mechanism. In previous studies, we proposed a scattering model to characterize the scattering in THz bands, which can only reconstruct the sca… ▽ More The terahertz (THz) band (0.1-10 THz) is widely considered to be a candidate band for the sixth-generation mobile communication technology (6G). However, due to its short wavelength (less than 1 mm), scattering becomes a particularly significant propagation mechanism. In previous studies, we proposed a scattering model to characterize the scattering in THz bands, which can only reconstruct the scattering in the incidence plane. In this paper, a three-dimensional (3D) stochastic model is proposed to characterize the THz scattering on rough surfaces. Then, we reconstruct the scattering on rough surfaces with different shapes and under different incidence angles utilizing the proposed model. Good agreements can be achieved between the proposed model and full-wave simulation results. This stochastic 3D scattering model can be integrated into the standard channel modeling framework to realize more realistic THz channel data for the evaluation of 6G. △ Less

Submitted 5 May, 2023; originally announced May 2023.

arXiv:2303.13512 [pdf, other]

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

Authors: Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Sharada Mohanty, Byron Galbraith, Ke Chen, Yan Song, Tianze Zhou, Bingquan Yu, He Liu, Kai Guan, Yujing Hu, Tangjie Lv, Federico Malato, Florian Leopold, Amogh Raut, Ville Hautamäki, Andrew Melnik, Shu Ishida, João F. Henriques, Robert Klassert, Walter Laurito, Ellen Novoseller , et al. (5 additional authors not shown)

Abstract: To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms to solve tasks with hard-to-specify reward functions in Minecraft. Through this competition, we aimed to promote the development of algorithms that use… ▽ More To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms to solve tasks with hard-to-specify reward functions in Minecraft. Through this competition, we aimed to promote the development of algorithms that use human feedback as channels to learn the desired behavior. We describe the competition and provide an overview of the top solutions. We conclude by discussing the impact of the competition and future directions for improvement. △ Less

Submitted 23 March, 2023; originally announced March 2023.

arXiv:2301.11557 [pdf, other]

A Ray-tracing and Deep Learning Fusion Super-resolution Modeling Method for Wireless Mobile Channel

Authors: Zhao Zhang, Danping He, Xiping Wang, Ke Guan, Zhangdui Zhong, Jianwu Dou

Abstract: Mobile channel modeling has always been the core part for design, deployment and optimization of communication system, especially in 5G and beyond era. Deterministic channel modeling could precisely achieve mobile channel description, however with defects of equipment and time consuming. In this paper, we proposed a novel super resolution (SR) model for cluster characteristics prediction. The mode… ▽ More Mobile channel modeling has always been the core part for design, deployment and optimization of communication system, especially in 5G and beyond era. Deterministic channel modeling could precisely achieve mobile channel description, however with defects of equipment and time consuming. In this paper, we proposed a novel super resolution (SR) model for cluster characteristics prediction. The model is based on deep neural networks with residual connection. A series of simulations at 3.5 GHz are conducted by a three-dimensional ray tracing (RT) simulator in diverse scenarios. Cluster characteristics are extracted and corresponding data sets are constructed to train the model. Experiments demonstrate that the proposed SR approach could achieve better power and cluster location prediction performance than traditional interpolation method and the root mean square error (RMSE) drops by 51% and 78% relatively. Channel impulse response (CIR) is reconstructed based on cluster characteristics, which could match well with the multi-path component (MPC). The proposed method can be used to efficiently and accurately generate big data of mobile channel, which significantly reduces the computation time of RT-only. △ Less

Submitted 27 January, 2023; originally announced January 2023.

Comments: 5 pages,7 figures,accepted by EuCAP2023

arXiv:2301.04479 [pdf, other]

Super-resolution of Ray-tracing Channel Simulation via Attention Mechanism based Deep Learning Model

Authors: Haoyang Zhang, Danping He, Xiping Wang, Wenbin Wang, Yunhao Cheng, Ke Guan

Abstract: As an emerging approach, deep learning plays an increasingly influential role in channel modeling. Traditional ray tracing (RT) methods of channel modeling tend to be inefficient and expensive. In this paper, we present a super-resolution (SR) model for channel characteristics. Residual connection and attention mechanism are applied to this convolutional neural network (CNN) model. Experiments pro… ▽ More As an emerging approach, deep learning plays an increasingly influential role in channel modeling. Traditional ray tracing (RT) methods of channel modeling tend to be inefficient and expensive. In this paper, we present a super-resolution (SR) model for channel characteristics. Residual connection and attention mechanism are applied to this convolutional neural network (CNN) model. Experiments prove that the proposed model can reduce the noise interference generated in the SR process and solve the problem of low efficiency of RT. The mean absolute error of our channel SR model on the PL achieves the effect of 2.82 dB with scale factor 2, the same accuracy as RT took only 52\% of the time in theory. Compared with vision transformer (ViT), the proposed model also demonstrates less running time and computing cost in SR of channel characteristics. △ Less

Submitted 21 January, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

arXiv:2209.13525 [pdf, other]

Retrieval Based Time Series Forecasting

Authors: Baoyu Jing, Si Zhang, Yada Zhu, Bin Peng, Kaiyu Guan, Andrew Margenot, Hanghang Tong

Abstract: Time series data appears in a variety of applications such as smart transportation and environmental monitoring. One of the fundamental problems for time series analysis is time series forecasting. Despite the success of recent deep time series forecasting methods, they require sufficient observation of historical values to make accurate forecasting. In other words, the ratio of the output length… ▽ More Time series data appears in a variety of applications such as smart transportation and environmental monitoring. One of the fundamental problems for time series analysis is time series forecasting. Despite the success of recent deep time series forecasting methods, they require sufficient observation of historical values to make accurate forecasting. In other words, the ratio of the output length (or forecasting horizon) to the sum of the input and output lengths should be low enough (e.g., 0.3). As the ratio increases (e.g., to 0.8), the uncertainty for the forecasting accuracy increases significantly. In this paper, we show both theoretically and empirically that the uncertainty could be effectively reduced by retrieving relevant time series as references. In the theoretical analysis, we first quantify the uncertainty and show its connections to the Mean Squared Error (MSE). Then we prove that models with references are easier to learn than models without references since the retrieved references could reduce the uncertainty. To empirically demonstrate the effectiveness of the retrieval based time series forecasting models, we introduce a simple yet effective two-stage method, called ReTime consisting of a relational retrieval and a content synthesis. We also show that ReTime can be easily adapted to the spatial-temporal time series and time series imputation settings. Finally, we evaluate ReTime on real-world datasets to demonstrate its effectiveness. △ Less

Submitted 27 September, 2022; originally announced September 2022.

Comments: CIKM'22 AMLTS

arXiv:2209.04207 [pdf, other]

A Multi-Task Learning Model for Super Resolution of Wireless Channel Characteristics

Authors: Xiping Wang, Zhao Zhang, Danping He, Ke Guan, Dongliang Liu, Jianwu Dou

Abstract: Channel modeling has always been the core part in communication system design and development, especially in 5G and 6G era. Traditional approaches like stochastic channel modeling and ray-tracing (RT) based channel modeling depend heavily on measurement data or simulation, which are usually expensive and time consuming. In this paper, we propose a novel super resolution (SR) model for generating c… ▽ More Channel modeling has always been the core part in communication system design and development, especially in 5G and 6G era. Traditional approaches like stochastic channel modeling and ray-tracing (RT) based channel modeling depend heavily on measurement data or simulation, which are usually expensive and time consuming. In this paper, we propose a novel super resolution (SR) model for generating channel characteristics data. The model is based on multi-task learning (MTL) convolutional neural networks (CNN) with residual connection. Experiments demonstrate that the proposed SR model could achieve excellent performances in mean absolute error and standard deviation of error. Advantages of the proposed model are demonstrated in comparisons with other state-of-the-art deep learning models. Ablation study also proved the necessity of multi-task learning and techniques in model design. The contribution in this paper could be helpful in channel modeling, network optimization, positioning and other wireless channel characteristics related work by largely reducing workload of simulation or measurement. △ Less

Submitted 9 September, 2022; originally announced September 2022.

Comments: 6 pages, GLOBECOM 2022 CQRM accepted. Thanks haoyang for his help in uploading :)

arXiv:2201.05261 [pdf]

Adaptive Transfer Learning for Plant Phenotyping

Authors: Jun Wu, Elizabeth A. Ainsworth, Sheng Wang, Kaiyu Guan, Jingrui He

Abstract: Plant phenotyping (Guo et al. 2021; Pieruschka et al. 2019) focuses on studying the diverse traits of plants related to the plants' growth. To be more specific, by accurately measuring the plant's anatomical, ontogenetical, physiological and biochemical properties, it allows identifying the crucial factors of plants' growth in different environments. One commonly used approach is to predict the pl… ▽ More Plant phenotyping (Guo et al. 2021; Pieruschka et al. 2019) focuses on studying the diverse traits of plants related to the plants' growth. To be more specific, by accurately measuring the plant's anatomical, ontogenetical, physiological and biochemical properties, it allows identifying the crucial factors of plants' growth in different environments. One commonly used approach is to predict the plant's traits using hyperspectral reflectance (Yendrek et al. 2017; Wang et al. 2021). However, the data distributions of the hyperspectral reflectance data in plant phenotyping might vary in different environments for different plants. That is, it would be computationally expansive to learn the machine learning models separately for one plant in different environments. To solve this problem, we focus on studying the knowledge transferability of modern machine learning models in plant phenotyping. More specifically, this work aims to answer the following questions. (1) How is the performance of conventional machine learning models, e.g., partial least squares regression (PLSR), Gaussian process regression (GPR) and multi-layer perceptron (MLP), affected by the number of annotated samples for plant phenotyping? (2) Whether could the neural network based transfer learning models improve the performance of plant phenotyping? (3) Could the neural network based transfer learning be improved by using infinite-width hidden layers for plant phenotyping? △ Less

Submitted 13 January, 2022; originally announced January 2022.

arXiv:2201.02834 [pdf, other]

Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network

Authors: Bile Peng, Jan-Aike Termöhlen, Cong Sun, Danping He, Ke Guan, Tim Fingscheidt, Eduard A. Jorswieck

Abstract: Reconfigurable intelligent surface (RIS) is an emerging technology for future wireless communication systems. In this work, we consider downlink spatial multiplexing enabled by the RIS for weighted sum-rate (WSR) maximization. In the literature, most solutions use alternating gradient-based optimization, which has moderate performance, high complexity, and limited scalability. We propose to apply… ▽ More Reconfigurable intelligent surface (RIS) is an emerging technology for future wireless communication systems. In this work, we consider downlink spatial multiplexing enabled by the RIS for weighted sum-rate (WSR) maximization. In the literature, most solutions use alternating gradient-based optimization, which has moderate performance, high complexity, and limited scalability. We propose to apply a fully convolutional network (FCN) to solve this problem, which was originally designed for semantic segmentation of images. The rectangular shape of the RIS and the spatial correlation of channels with adjacent RIS antennas due to the short distance between them encourage us to apply it for the RIS configuration. We design a set of channel features that includes both cascaded channels via the RIS and the direct channel. In the base station (BS), the differentiable minimum mean squared error (MMSE) precoder is used for pretraining and the weighted minimum mean squared error (WMMSE) precoder is then applied for fine-tuning, which is nondifferentiable, more complex, but achieves a better performance. Evaluation results show that the proposed solution has higher performance and allows for a faster evaluation than the baselines. Hence it scales better to a large number of antennas, advancing the RIS one step closer to practical deployment. △ Less

Submitted 21 September, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

arXiv:2112.07893 [pdf, other]

Graph-based Ensemble Machine Learning for Student Performance Prediction

Authors: Yinkai Wang, Aowei Ding, Kaiyi Guan, Shixi Wu, Yuanqi Du

Abstract: Student performance prediction is a critical research problem to understand the students' needs, present proper learning opportunities/resources, and develop the teaching quality. However, traditional machine learning methods fail to produce stable and accurate prediction results. In this paper, we propose a graph-based ensemble machine learning method that aims to improve the stability of single… ▽ More Student performance prediction is a critical research problem to understand the students' needs, present proper learning opportunities/resources, and develop the teaching quality. However, traditional machine learning methods fail to produce stable and accurate prediction results. In this paper, we propose a graph-based ensemble machine learning method that aims to improve the stability of single machine learning methods via the consensus of multiple methods. To be specific, we leverage both supervised prediction methods and unsupervised clustering methods, build an iterative approach that propagates in a bipartite graph as well as converges to more stable and accurate prediction results. Extensive experiments demonstrate the effectiveness of our proposed method in predicting more accurate student performance. Specifically, our model outperforms the best traditional machine learning algorithms by up to 14.8% in prediction accuracy. △ Less

Submitted 21 December, 2021; v1 submitted 15 December, 2021; originally announced December 2021.

Comments: 5 pages, 3 figures and 3 tables

arXiv:2109.02762 [pdf, other]

STRIVE: Scene Text Replacement In Videos

Authors: Vijay Kumar B G, Jeyasri Subramanian, Varnith Chordia, Eugene Bart, Shaobo Fang, Kelly Guan, Raja Bala

Abstract: We propose replacing scene text in videos using deep style transfer and learned photometric transformations.Building on recent progress on still image text replacement,we present extensions that alter text while preserving the appearance and motion characteristics of the original video.Compared to the problem of still image text replacement,our method addresses additional challenges introduced by… ▽ More We propose replacing scene text in videos using deep style transfer and learned photometric transformations.Building on recent progress on still image text replacement,we present extensions that alter text while preserving the appearance and motion characteristics of the original video.Compared to the problem of still image text replacement,our method addresses additional challenges introduced by video, namely effects induced by changing lighting, motion blur, diverse variations in camera-object pose over time,and preservation of temporal consistency. We parse the problem into three steps. First, the text in all frames is normalized to a frontal pose using a spatio-temporal trans-former network. Second, the text is replaced in a single reference frame using a state-of-art still-image text replacement method. Finally, the new text is transferred from the reference to remaining frames using a novel learned image transformation network that captures lighting and blur effects in a temporally consistent manner. Results on synthetic and challenging real videos show realistic text trans-fer, competitive quantitative and qualitative performance,and superior inference speed relative to alternatives. We introduce new synthetic and real-world datasets with paired text objects. To the best of our knowledge this is the first attempt at deep video text replacement. △ Less

Submitted 6 September, 2021; originally announced September 2021.

Comments: ICCV 2021, Project Page: https://striveiccv2021.github.io/STRIVE-ICCV2021/

arXiv:2108.11902 [pdf, other]

Cluster-based Characterization and Modeling for UAV Air-to-Ground Time-Varying Channels

Authors: Zhuangzhuang Cui, Ke Guan, Claude Oestges, César Briso-Rodríguez, Bo Ai, Zhangdui Zhong

Abstract: With the deep integration between the unmanned aerial vehicle (UAV) and wireless communication, UAV-based air-to-ground (AG) propagation channels need more detailed descriptions and accurate models. In this paper, we aim to perform cluster-based characterization and modeling for AG channels. To our best knowledge, this is the first study that concentrates on the clustering and tracking of multipat… ▽ More With the deep integration between the unmanned aerial vehicle (UAV) and wireless communication, UAV-based air-to-ground (AG) propagation channels need more detailed descriptions and accurate models. In this paper, we aim to perform cluster-based characterization and modeling for AG channels. To our best knowledge, this is the first study that concentrates on the clustering and tracking of multipath components (MPCs) for time-varying AG channels. Based on measurement data at 6.5 GHz with 500 MHz of bandwidth, we first estimate potential MPCs utilizing the space-alternating generalized expectation-maximization (SAGE) algorithm. Then, we cluster the extracted MPCs considering their static and dynamic characteristics by employing K-Power-Means (KPM) algorithm under multipath component distance (MCD) measure. For characterizing time-variant clusters, we exploit a clustering-based tracking (CBT) method, which efficiently quantifies the survival lengths of clusters. Ultimately, we establish a cluster-based channel model, and validations illustrate the accuracy of the proposed model. This work not only promotes a better understanding of AG propagation channels but also provides a general cluster-based AG channel model with certain extensibility. △ Less

Submitted 26 August, 2021; originally announced August 2021.

arXiv:2105.13717 [pdf, other]

doi 10.1109/GLOBECOM46510.2021.9685078

Coverage Analysis of Cellular-Connected UAV Communications with 3GPP Antenna and Channel Models

Authors: Zhuangzhuang Cui, Ke Guan, İsmail Güvenç, Claude Oestges, Zhangdui Zhong

Abstract: For reliable and efficient communications of aerial platforms, such as unmanned aerial vehicles (UAVs), the cellular network is envisioned to provide connectivity for the aerial and ground user equipment (GUE) simultaneously, which brings challenges to the existing pattern of the base station (BS) tailored for ground-level services. Thus, we focus on the coverage probability analysis to investigat… ▽ More For reliable and efficient communications of aerial platforms, such as unmanned aerial vehicles (UAVs), the cellular network is envisioned to provide connectivity for the aerial and ground user equipment (GUE) simultaneously, which brings challenges to the existing pattern of the base station (BS) tailored for ground-level services. Thus, we focus on the coverage probability analysis to investigate the coexistence of aerial and terrestrial users, by employing realistic antenna and channel models reported in the 3rd Generation Partnership Project (3GPP). The homogeneous Poisson point process (PPP) is used to describe the BS distribution, and the BS antenna is adjustable in the down-tilted angle and the number of the antenna array. Meantime, omnidirectional antennas are used for cellular users. We first derive the approximation of coverage probability and then conduct numerous simulations to evaluate the impacts of antenna numbers, down-tilted angles, carrier frequencies, and user heights. One of the essential findings indicates that the coverage probabilities of high-altitude users become less sensitive to the down-tilted angle. Moreover, we found that the aerial user equipment (AUE) in a certain range of heights can achieve the same or better coverage probability than that of GUE, which provides an insight into the effective deployment of cellular-connected aerial communications. △ Less

Submitted 28 May, 2021; originally announced May 2021.

arXiv:2102.03012 [pdf, other]

A Serverless Cloud-Fog Platform for DNN-Based Video Analytics with Incremental Learning

Authors: Huaizheng Zhang, Meng Shen, Yizheng Huang, Yonggang Wen, Yong Luo, Guanyu Gao, Kyle Guan

Abstract: DNN-based video analytics have empowered many new applications (e.g., automated retail). Meanwhile, the proliferation of fog devices provides developers with more design options to improve performance and save cost. To the best of our knowledge, this paper presents the first serverless system that takes full advantage of the client-fog-cloud synergy to better serve the DNN-based video analytics. S… ▽ More DNN-based video analytics have empowered many new applications (e.g., automated retail). Meanwhile, the proliferation of fog devices provides developers with more design options to improve performance and save cost. To the best of our knowledge, this paper presents the first serverless system that takes full advantage of the client-fog-cloud synergy to better serve the DNN-based video analytics. Specifically, the system aims to achieve two goals: 1) Provide the optimal analytics results under the constraints of lower bandwidth usage and shorter round-trip time (RTT) by judiciously managing the computational and bandwidth resources deployed in the client, fog, and cloud environment. 2) Free developers from tedious administration and operation tasks, including DNN deployment, cloud and fog's resource management. To this end, we implement a holistic cloud-fog system referred to as VPaaS (Video-Platform-as-a-Service). VPaaS adopts serverless computing to enable developers to build a video analytics pipeline by simply programming a set of functions (e.g., model inference), which are then orchestrated to process videos through carefully designed modules. To save bandwidth and reduce RTT, VPaaS provides a new video streaming protocol that only sends low-quality video to the cloud. The state-of-the-art (SOTA) DNNs deployed at the cloud can identify regions of video frames that need further processing at the fog ends. At the fog ends, misidentified labels in these regions can be corrected using a light-weight DNN model. To address the data drift issues, we incorporate limited human feedback into the system to verify the results and adopt incremental learning to improve our system continuously. The evaluation demonstrates that VPaaS is superior to several SOTA systems: it maintains high accuracy while reducing bandwidth usage by up to 21%, RTT by up to 62.5%, and cloud monetary cost by up to 50%. △ Less

Submitted 5 February, 2021; originally announced February 2021.

Comments: 11 pages, 16 figures

arXiv:2012.06707 [pdf, other]

Channel Modeling for UAV Communications: State of the Art, Case Studies, and Future Directions

Authors: Zhuangzhuang Cui, Ke Guan, César Briso-Rodríguez, Bo Ai, Zhangdui Zhong, Claude Oestges

Abstract: As essential aerial platforms, unmanned aerial vehicles (UAVs) play an increasingly important role in broad wireless connectivity and high-data-rate transmission for future communication systems. Notably, various communication scenarios are involved in UAV communications, such as intercommunications between UAVs and communications with the ground user equipment, the cellular base station, and the… ▽ More As essential aerial platforms, unmanned aerial vehicles (UAVs) play an increasingly important role in broad wireless connectivity and high-data-rate transmission for future communication systems. Notably, various communication scenarios are involved in UAV communications, such as intercommunications between UAVs and communications with the ground user equipment, the cellular base station, and the ground station, to name a few. However, existing works mostly focus on a single communication scenario, a designated channel type, and a specific operating frequency, thus urgently requiring a comprehensive understanding of multi-scenario, multi-frequency, and multi-type UAV channels. This article pours attention into the essentials of corresponding air-to-air (A2A) and air-to-ground (A2G) channels in UAV communications. We first identify the latest key challenges of channel modeling for UAV communications. We then provide the state of the art for A2A and A2G channel properties and models based on extensive measurement campaigns. In particular, we conduct realistic case studies to further demonstrate critical channel characterizations and machine learning-based modeling methods. Last but not least, potential directions are widely discussed for paving the way towards more accurate and effective channel models for UAV communications. △ Less

Submitted 16 April, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

arXiv:2012.03171 [pdf, other]

doi 10.1109/TVT.2021.3063408

Coverage Probability Analysis of IRS-Aided Communication Systems

Authors: Zhuangzhuang Cui, Ke Guan, Jiayi Zhang, Zhangdui Zhong

Abstract: The intelligent reflective surface (IRS) technology has received many interests in recent years, thanks to its potential uses in future wireless communications, in which one of the promising use cases is to widen coverage, especially in the line-of-sight-blocked scenarios. Therefore, it is critical to analyze the corresponding coverage probability of IRS-aided communication systems. To our best kn… ▽ More The intelligent reflective surface (IRS) technology has received many interests in recent years, thanks to its potential uses in future wireless communications, in which one of the promising use cases is to widen coverage, especially in the line-of-sight-blocked scenarios. Therefore, it is critical to analyze the corresponding coverage probability of IRS-aided communication systems. To our best knowledge, however, previous works focusing on this issue are very limited. In this paper, we analyze the coverage probability under the Rayleigh fading channel, taking the number and size of the array elements into consideration. We first derive the exact closed-form of coverage probability for the unit element. Afterward, with the method of moment matching, the approximation of the coverage probability can be formulated as the ratio of upper incomplete Gamma function and Gamma function, allowing an arbitrary number of elements. Finally, we comprehensively evaluate the impacts of essential factors on the coverage probability, such as the coefficient of fading channel, the number and size of the element, and the angle of incidence. Overall, the paper provides a succinct and general expression of coverage probability, which can be helpful in the performance evaluation and practical implementation of the IRS. △ Less

Submitted 5 December, 2020; originally announced December 2020.

arXiv:2012.00267 [pdf, ps, other]

Performance and Optimization of Reconfigurable Intelligent Surface Aided THz Communications

Authors: Hongyang Du, Jiayi Zhang, Ke Guan, Dusit Niyato, Huiying Jiao, Zhiqin Wang, Thomas Kürner

Abstract: TeraHertz (THz) communications can satisfy the high data rate demand with massive bandwidth. However, severe path attenuation and hardware imperfection greatly alleviate its performance. Therefore, we utilize the reconfigurable intelligent surface (RIS) technology and investigate the RIS-aided THz communications. We first prove that the small-scale amplitude fading of THz signals can be accurately… ▽ More TeraHertz (THz) communications can satisfy the high data rate demand with massive bandwidth. However, severe path attenuation and hardware imperfection greatly alleviate its performance. Therefore, we utilize the reconfigurable intelligent surface (RIS) technology and investigate the RIS-aided THz communications. We first prove that the small-scale amplitude fading of THz signals can be accurately modeled by the fluctuating two-ray distribution based on two THz signal measurement experiments conducted in a variety of different scenarios. To optimize the phase-shifts at the RIS elements, we propose a novel swarm intelligence-based method that does not require full channel estimation. We then derive exact statistical characterizations of end-to-end signal-to-noise plus distortion ratio (SNDR) and signal-to-noise ratio (SNR). Moreover, we present asymptotic analysis to obtain more insights when the SNDR or the number of RIS's elements is high. Finally, we derive analytical expressions for the outage probability and ergodic capacity. The tight upper bounds of ergodic capacity for both ideal and nonideal radio frequency chains are obtained. It is interesting to find that increasing the number of RIS's elements can significantly improve the THz communications system performance. For example, the ergodic capacity can increase up to 25% when the number of elements increases from 40 to 80, which incurs only insignificant costs to the system. △ Less

Submitted 20 March, 2022; v1 submitted 1 December, 2020; originally announced December 2020.

arXiv:2011.02327 [pdf, other]

InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System

Authors: Huaizheng Zhang, Yizheng Huang, Yonggang Wen, Jianxiong Yin, Kyle Guan

Abstract: Deep learning (DL) models have become core modules for many applications. However, deploying these models without careful performance benchmarking that considers both hardware and software's impact often leads to poor service and costly operational expenditure. To facilitate DL models' deployment, we implement an automatic and comprehensive benchmark system for DL developers. To accomplish benchma… ▽ More Deep learning (DL) models have become core modules for many applications. However, deploying these models without careful performance benchmarking that considers both hardware and software's impact often leads to poor service and costly operational expenditure. To facilitate DL models' deployment, we implement an automatic and comprehensive benchmark system for DL developers. To accomplish benchmark-related tasks, the developers only need to prepare a configuration file consisting of a few lines of code. Our system, deployed to a leader server in DL clusters, will dispatch users' benchmark jobs to follower workers. Next, the corresponding requests, workload, and even models can be generated automatically by the system to conduct DL serving benchmarks. Finally, developers can leverage many analysis tools and models in our system to gain insights into the trade-offs of different system configurations. In addition, a two-tier scheduler is incorporated to avoid unnecessary interference and improve average job compilation time by up to 1.43x (equivalent of 30\% reduction). Our system design follows the best practice in DL clusters operations to expedite day-to-day DL service evaluation efforts by the developers. We conduct many benchmark experiments to provide in-depth and comprehensive evaluations. We believe these results are of great values as guidelines for DL service configuration and resource allocation. △ Less

Submitted 5 January, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

Comments: 13 pages, 15 figures

arXiv:2008.07220 [pdf, other]

Scoring the Terabit/s Goal:Broadband Connectivity in 6G

Authors: Nandana Rajatheva, Italo Atzeni, Simon Bicais, Emil Bjornson, Andre Bourdoux, Stefano Buzzi, Carmen D'Andrea, Jean-Baptiste Dore, Serhat Erkucuk, Manuel Fuentes, Ke Guan, Yuzhou Hu, Xiaojing Huang, Jari Hulkkonen, Josep Miquel Jornet, Marcos Katz, Behrooz Makki, Rickard Nilsson, Erdal Panayirci, Khaled Rabie, Nuwanthika Rajapaksha, MohammadJavad Salehi, Hadi Sarieddeen, Shahriar Shahabuddin, Tommy Svensson , et al. (4 additional authors not shown)

Abstract: This paper explores the road to vastly improving the broadband connectivity in future 6G wireless systems. Different categories of use cases are considered, with peak data rates up to 1 Tbps. Several categories of enablers at the infrastructure, spectrum, and protocol/algorithmic levels are required to realize the intended broadband connectivity goals in 6G. At the infrastructure level, we conside… ▽ More This paper explores the road to vastly improving the broadband connectivity in future 6G wireless systems. Different categories of use cases are considered, with peak data rates up to 1 Tbps. Several categories of enablers at the infrastructure, spectrum, and protocol/algorithmic levels are required to realize the intended broadband connectivity goals in 6G. At the infrastructure level, we consider ultra-massive MIMO technology (possibly implemented using holographic radio), intelligent reflecting surfaces, user-centric cell-free networking, integrated access and backhaul, and integrated space and terrestrial networks. At the spectrum level, the network must seamlessly utilize sub-6 GHz bands for coverage and spatial multiplexing of many devices, while higher bands will be mainly used for pushing the peak rates of point-to-point links. Finally, at the protocol/algorithmic level, the enablers include improved coding, modulation, and waveforms to achieve lower latency, higher reliability, and reduced complexity. △ Less

Submitted 21 February, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

Comments: Submitted to IEEE Access. 51 pages,31 figures. arXiv admin note: text overlap with arXiv:2004.14247

arXiv:2006.06168 [pdf, other]

Satellite-Terrestrial Channel Characterization in High-Speed Railway Environment at 22.6 GHz

Authors: Lei Ma, Ke Guan, Dong Yan, Danping He, Nuno R. Leonor, Bo Ai, Junhyeong Kim

Abstract: The integration of satellite and terrestrial communication systems plays a vital role in the fifth-generation mobile communication system (5G) for the ubiquitous coverage, reliable service and flexible networking. Moreover, the millimeter wave (mmWave) communication with large bandwidth is a key enabler for 5G intelligent rail transportation. In this paper, the satellite-terrestrial channel at 22.… ▽ More The integration of satellite and terrestrial communication systems plays a vital role in the fifth-generation mobile communication system (5G) for the ubiquitous coverage, reliable service and flexible networking. Moreover, the millimeter wave (mmWave) communication with large bandwidth is a key enabler for 5G intelligent rail transportation. In this paper, the satellite-terrestrial channel at 22.6 GHz is characterized for a typical high-speed railway (HSR) environment. The three-dimensional model of the railway scenario is reconstructed and imported into the Cloud Ray-Tracing (CloudRT) simulation platform. Based on extensive ray-tracing simulations, the channel for the terrestrial HSR system and the satellite-terrestrial system with two weather conditions are characterized, and the interference between them are evaluated. The results of this paper can help for the design and evaluation for the satellite-terrestrial communication system enabling future intelligent rail transportation. △ Less

Submitted 10 June, 2020; originally announced June 2020.

arXiv:2006.05096 [pdf, other]

doi 10.1145/3394171.3414535

MLModelCI: An Automatic Cloud Platform for Efficient MLaaS

Authors: Huaizheng Zhang, Yuanming Li, Yizheng Huang, Yonggang Wen, Jianxiong Yin, Kyle Guan

Abstract: MLModelCI provides multimedia researchers and developers with a one-stop platform for efficient machine learning (ML) services. The system leverages DevOps techniques to optimize, test, and manage models. It also containerizes and deploys these optimized and validated models as cloud services (MLaaS). In its essence, MLModelCI serves as a housekeeper to help users publish models. The models are fi… ▽ More MLModelCI provides multimedia researchers and developers with a one-stop platform for efficient machine learning (ML) services. The system leverages DevOps techniques to optimize, test, and manage models. It also containerizes and deploys these optimized and validated models as cloud services (MLaaS). In its essence, MLModelCI serves as a housekeeper to help users publish models. The models are first automatically converted to optimized formats for production purpose and then profiled under different settings (e.g., batch size and hardware). The profiling information can be used as guidelines for balancing the trade-off between performance and cost of MLaaS. Finally, the system dockerizes the models for ease of deployment to cloud environments. A key feature of MLModelCI is the implementation of a controller, which allows elastic evaluation which only utilizes idle workers while maintaining online service quality. Our system bridges the gap between current ML training and serving systems and thus free developers from manual and tedious work often associated with service deployment. We release the platform as an open-source project on GitHub under Apache 2.0 license, with the aim that it will facilitate and streamline more large-scale ML applications and research projects. △ Less

Submitted 9 June, 2020; originally announced June 2020.

Comments: 4 pages, 4 figures

Journal ref: In Proceedings of the 28th ACM International Conference on Multimedia (2020) 4453-4456

arXiv:2004.14247 [pdf, other]

White Paper on Broadband Connectivity in 6G

Authors: Nandana Rajatheva, Italo Atzeni, Emil Bjornson, Andre Bourdoux, Stefano Buzzi, Jean-Baptiste Dore, Serhat Erkucuk, Manuel Fuentes, Ke Guan, Yuzhou Hu, Xiaojing Huang, Jari Hulkkonen, Josep Miquel Jornet, Marcos Katz, Rickard Nilsson, Erdal Panayirci, Khaled Rabie, Nuwanthika Rajapaksha, MohammadJavad Salehi, Hadi Sarieddeen, Tommy Svensson, Oskari Tervo, Antti Tolli, Qingqing Wu, Wen Xu

Abstract: This white paper explores the road to implementing broadband connectivity in future 6G wireless systems. Different categories of use cases are considered, from extreme capacity with peak data rates up to 1 Tbps, to raising the typical data rates by orders-of-magnitude, to support broadband connectivity at railway speeds up to 1000 km/h. To achieve these goals, not only the terrestrial networks wil… ▽ More This white paper explores the road to implementing broadband connectivity in future 6G wireless systems. Different categories of use cases are considered, from extreme capacity with peak data rates up to 1 Tbps, to raising the typical data rates by orders-of-magnitude, to support broadband connectivity at railway speeds up to 1000 km/h. To achieve these goals, not only the terrestrial networks will be evolved but they will also be integrated with satellite networks, all facilitating autonomous systems and various interconnected structures. We believe that several categories of enablers at the infrastructure, spectrum, and protocol/ algorithmic levels are required to realize the intended broadband connectivity goals in 6G. At the infrastructure level, we consider ultra-massive MIMO technology (possibly implemented using holographic radio), intelligent reflecting surfaces, user-centric and scalable cell-free networking, integrated access and backhaul, and integrated space and terrestrial networks. At the spectrum level, the network must seamlessly utilize sub-6 GHz bands for coverage and spatial multiplexing of many devices, while higher bands will be used for pushing the peak rates of point-to-point links. The latter path will lead to THz communications complemented by visible light communications in specific scenarios. At the protocol/algorithmic level, the enablers include improved coding, modulation, and waveforms to achieve lower latencies, higher reliability, and reduced complexity. Different options will be needed to optimally support different use cases. The resource efficiency can be further improved by using various combinations of full-duplex radios, interference management based on rate-splitting, machine-learning-based optimization, coded caching, and broadcasting. △ Less

Submitted 29 April, 2020; originally announced April 2020.

Comments: 46 pages, 13 figures

arXiv:1911.03607 [pdf, other]

DeepMask: an algorithm for cloud and cloud shadow detection in optical satellite remote sensing images using deep residual network

Authors: Ke Xu, Kaiyu Guan, Jian Peng, Yunan Luo, Sibo Wang

Abstract: Detecting and masking cloud and cloud shadow from satellite remote sensing images is a pervasive problem in the remote sensing community. Accurate and efficient detection of cloud and cloud shadow is an essential step to harness the value of remotely sensed data for almost all downstream analysis. DeepMask, a new algorithm for cloud and cloud shadow detection in optical satellite remote sensing im… ▽ More Detecting and masking cloud and cloud shadow from satellite remote sensing images is a pervasive problem in the remote sensing community. Accurate and efficient detection of cloud and cloud shadow is an essential step to harness the value of remotely sensed data for almost all downstream analysis. DeepMask, a new algorithm for cloud and cloud shadow detection in optical satellite remote sensing imagery, is proposed in this study. DeepMask utilizes ResNet, a deep convolutional neural network, for pixel-level cloud mask generation. The algorithm is trained and evaluated on the Landsat 8 Cloud Cover Assessment Validation Dataset distributed across 8 different land types. Compared with CFMask, the most widely used cloud detection algorithm, land-type-specific DeepMask models achieve higher accuracy across all land types. The average accuracy is 93.56%, compared with 85.36% from CFMask. DeepMask also achieves 91.02% accuracy on all-land-type dataset. Compared with other CNN-based cloud mask algorithms, DeepMask benefits from the parsimonious architecture and the residual connection of ResNet. It is compatible with input of any size and shape. DeepMask still maintains high performance when using only red, green, blue, and NIR bands, indicating its potential to be applied to other satellite platforms that only have limited optical bands. △ Less

Submitted 8 November, 2019; originally announced November 2019.

Comments: 17 pages, 4 figures, 6 tables

arXiv:1907.01518 [pdf, other]

Analytical Modeling of UAV-to-Vehicle Propagation Channels in Built-Up Areas

Authors: Zhuangzhuang Cui, Ke Guan, César Briso, Danping He, Jianqiao Cheng, Zhangdui Zhong, François Quitin

Abstract: This letter presents an analytical path loss model for air-ground (AG) propagation between unmanned aerial vehicles (UAVs) and ground-based vehicles. We consider built-up areas, such as the ones defined by ITU-R. The three-dimensional (3D) path loss model is based on propagation conditions and essential parameters are derived by using geometric methods. Owing to the generality, the analytical mode… ▽ More This letter presents an analytical path loss model for air-ground (AG) propagation between unmanned aerial vehicles (UAVs) and ground-based vehicles. We consider built-up areas, such as the ones defined by ITU-R. The three-dimensional (3D) path loss model is based on propagation conditions and essential parameters are derived by using geometric methods. Owing to the generality, the analytical model is capable of arbitrary deployments of buildings, such as suburban, urban and dense urban. The analytical model is evaluated numerically, and validations conducted by ray-tracing simulations show the high accuracy of the proposed model. The closed-form analytical formulas provide a useful tool for quick and accurate prediction of UAV-to-vehicle propagation channels. △ Less

Submitted 26 June, 2019; originally announced July 2019.

Comments: Submmitted to IEEE Wireless Communications Letters

arXiv:1906.10909 [pdf, other]

Probabilistic Two-Ray Model for Air-to-Air Channel in Built-Up Areas

Authors: Zhuangzhuang Cui, Ke Guan, César Briso, Danping He, Bo Ai, Zhangdui Zhong

Abstract: In this paper, we present a probabilistic two-ray (PTR) path loss model for air-to-air (AA) propagation channel in built-up areas. Based on the statistical model of city deployment, the PTR path loss model can be applied to suburban, urban, dense urban, and high-rise urban. The path loss is optimally fitted as the Weibull distribution and its fluctuation is fitted as the Normal distribution in ray… ▽ More In this paper, we present a probabilistic two-ray (PTR) path loss model for air-to-air (AA) propagation channel in built-up areas. Based on the statistical model of city deployment, the PTR path loss model can be applied to suburban, urban, dense urban, and high-rise urban. The path loss is optimally fitted as the Weibull distribution and its fluctuation is fitted as the Normal distribution in ray-tracing simulations. The good agreements between our model and ray tracing indicate the proposed model can provide a useful tool for accurate and quick prediction for aerial platforms. As an extended research of PTR model, we extract the shadowing factor by numerous simulations and propose the altitude-dependent shadowing model. The result shows that the proposed shadowing model has very good consistent with the measurement-based model, which indicates that our research performs well in the extensibility and generality. △ Less

Submitted 26 June, 2019; originally announced June 2019.

arXiv:1905.13719 [pdf, other]

Reinforcement Learning Experience Reuse with Policy Residual Representation

Authors: Wen-Ji Zhou, Yang Yu, Yingfeng Chen, Kai Guan, Tangjie Lv, Changjie Fan, Zhi-Hua Zhou

Abstract: Experience reuse is key to sample-efficient reinforcement learning. One of the critical issues is how the experience is represented and stored. Previously, the experience can be stored in the forms of features, individual models, and the average model, each lying at a different granularity. However, new tasks may require experience across multiple granularities. In this paper, we propose the polic… ▽ More Experience reuse is key to sample-efficient reinforcement learning. One of the critical issues is how the experience is represented and stored. Previously, the experience can be stored in the forms of features, individual models, and the average model, each lying at a different granularity. However, new tasks may require experience across multiple granularities. In this paper, we propose the policy residual representation (PRR) network, which can extract and store multiple levels of experience. PRR network is trained on a set of tasks with a multi-level architecture, where a module in each level corresponds to a subset of the tasks. Therefore, the PRR network represents the experience in a spectrum-like way. When training on a new task, PRR can provide different levels of experience for accelerating the learning. We experiment with the PRR network on a set of grid world navigation tasks, locomotion tasks, and fighting tasks in a video game. The results show that the PRR network leads to better reuse of experience and thus outperforms some state-of-the-art approaches. △ Less

Submitted 31 May, 2019; originally announced May 2019.

Comments: Conference version appears in IJCAI 2019

arXiv:1811.02350 [pdf, ps, other]

Resource Allocation for Device-to-Device Communications Underlaying Heterogeneous Cellular Networks Using Coalitional Games

Authors: Yali Chen, Bo Ai, Yong Niu, Ke Guan, Zhu Han

Abstract: Heterogeneous cellular networks (HCNs) with millimeter wave (mmWave) communications included are emerging as a promising candidate for the fifth generation mobile network. With highly directional antenna arrays, mmWave links are able to provide several-Gbps transmission rate. However, mmWave links are easily blocked without line of sight. On the other hand, D2D communications have been proposed to… ▽ More Heterogeneous cellular networks (HCNs) with millimeter wave (mmWave) communications included are emerging as a promising candidate for the fifth generation mobile network. With highly directional antenna arrays, mmWave links are able to provide several-Gbps transmission rate. However, mmWave links are easily blocked without line of sight. On the other hand, D2D communications have been proposed to support many content based applications, and need to share resources with users in HCNs to improve spectral reuse and enhance system capacity. Consequently, an efficient resource allocation scheme for D2D pairs among both mmWave and the cellular carrier band is needed. In this paper, we first formulate the problem of the resource allocation among mmWave and the cellular band for multiple D2D pairs from the view point of game theory. Then, with the characteristics of cellular and mmWave communications considered, we propose a coalition formation game to maximize the system sum rate in statistical average sense. We also theoretically prove that our proposed game converges to a Nash-stable equilibrium and further reaches the near-optimal solution with fast convergence rate. Through extensive simulations under various system parameters, we demonstrate the superior performance of our scheme in terms of the system sum rate compared with several other practical schemes. △ Less

Submitted 6 November, 2018; originally announced November 2018.

Comments: 13 pages, 12 figures

Journal ref: IEEE Transactions on Wireless Communications,Year: 2018 , Volume: 17 , Issue: 6, Pages: 4163 - 4176

arXiv:1805.09496 [pdf, other]

Intelligent Trainer for Model-Based Reinforcement Learning

Authors: Yuanlong Li, Linsen Dong, Xin Zhou, Yonggang Wen, Kyle Guan

Abstract: Model-based reinforcement learning (MBRL) has been proposed as a promising alternative solution to tackle the high sampling cost challenge in the canonical reinforcement learning (RL), by leveraging a learned model to generate synthesized data for policy training purpose. The MBRL framework, nevertheless, is inherently limited by the convoluted process of jointly learning control policy and config… ▽ More Model-based reinforcement learning (MBRL) has been proposed as a promising alternative solution to tackle the high sampling cost challenge in the canonical reinforcement learning (RL), by leveraging a learned model to generate synthesized data for policy training purpose. The MBRL framework, nevertheless, is inherently limited by the convoluted process of jointly learning control policy and configuring hyper-parameters (e.g., global/local models, real and synthesized data, etc). The training process could be tedious and prohibitively costly. In this research, we propose an "reinforcement on reinforcement" (RoR) architecture to decompose the convoluted tasks into two layers of reinforcement learning. The inner layer is the canonical model-based RL training process environment (TPE), which learns the control policy for the underlying system and exposes interfaces to access states, actions and rewards. The outer layer presents an RL agent, called as AI trainer, to learn an optimal hyper-parameter configuration for the inner TPE. This decomposition approach provides a desirable flexibility to implement different trainer designs, called as "train the trainer". In our research, we propose and optimize two alternative trainer designs: 1) a uni-head trainer and 2) a multi-head trainer. Our proposed RoR framework is evaluated for five tasks in the OpenAI gym (i.e., Pendulum, Mountain Car, Reacher, Half Cheetah and Swimmer). Compared to three other baseline algorithms, our proposed Train-the-Trainer algorithm has a competitive performance in auto-tuning capability, with upto 56% expected sampling cost saving without knowing the best parameter setting in advance. The proposed trainer framework can be easily extended to other cases in which the hyper-parameter tuning is costly. △ Less

Submitted 5 June, 2019; v1 submitted 23 May, 2018; originally announced May 2018.

Comments: 13 pages

arXiv:1804.03878 [pdf, other]

doi 10.1088/1751-8121/aacb44

The asymmetric quantum Rabi model and generalised Pöschl-Teller potentials

Authors: Kai-Long Guan, Zi-Min Li, Clare Dunning, Murray T Batchelor

Abstract: Starting with the Gaudin-like Bethe ansatz equations associated with the quasi-exactly solved (QES) exceptional points of the asymmetric quantum Rabi model (AQRM) a spectral equivalence is established with QES hyperbolic Schrödinger potentials on the line. This leads to particular QES Pöschl-Teller potentials. The complete spectral equivalence is then established between the AQRM and generalised P… ▽ More Starting with the Gaudin-like Bethe ansatz equations associated with the quasi-exactly solved (QES) exceptional points of the asymmetric quantum Rabi model (AQRM) a spectral equivalence is established with QES hyperbolic Schrödinger potentials on the line. This leads to particular QES Pöschl-Teller potentials. The complete spectral equivalence is then established between the AQRM and generalised Pöschl-Teller potentials. This result extends a previous mapping between the symmetric quantum Rabi model and a QES Pöschl-Teller potential. The complete spectral equivalence between the two systems suggests that the physics of the generalised Pöschl-Teller potentials may also be explored in experimental realisations of the quantum Rabi model. △ Less

Submitted 6 June, 2018; v1 submitted 11 April, 2018; originally announced April 2018.

Comments: 17 pages, 4 figures, minor changes, additional references

Journal ref: J. Phys. A: Math. Theor. 51 315204 (2018)

arXiv:1804.03481 [pdf, ps, other]

DeepQoE: A unified Framework for Learning to Predict Video QoE

Authors: Huaizheng Zhang, Han Hu, Guanyu Gao, Yonggang Wen, Kyle Guan

Abstract: Motivated by the prowess of deep learning (DL) based techniques in prediction, generalization, and representation learning, we develop a novel framework called DeepQoE to predict video quality of experience (QoE). The end-to-end framework first uses a combination of DL techniques (e.g., word embeddings) to extract generalized features. Next, these features are combined and fed into a neural networ… ▽ More Motivated by the prowess of deep learning (DL) based techniques in prediction, generalization, and representation learning, we develop a novel framework called DeepQoE to predict video quality of experience (QoE). The end-to-end framework first uses a combination of DL techniques (e.g., word embeddings) to extract generalized features. Next, these features are combined and fed into a neural network for representation learning. Such representations serve as inputs for classification or regression tasks. Evaluating the performance of DeepQoE with two datasets, we show that for the small dataset, the accuracy of all shallow learning algorithm is improved by using the representation derived from DeepQoE. For the large dataset, our DeepQoE framework achieves significant performance improvement in comparison to the best baseline method (90.94% vs. 82.84%). Moreover, DeepQoE, also released as an open source tool, provides video QoE research much-needed flexibility in fitting different datasets, extracting generalized features, and learning representations. △ Less

Submitted 10 April, 2018; originally announced April 2018.

Comments: 6 pages, 5 figures, ICME2018

arXiv:1709.05077 [pdf, other]

Transforming Cooling Optimization for Green Data Center via Deep Reinforcement Learning

Authors: Yuanlong Li, Yonggang Wen, Kyle Guan, Dacheng Tao

Abstract: Cooling system plays a critical role in a modern data center (DC). Developing an optimal control policy for DC cooling system is a challenging task. The prevailing approaches often rely on approximating system models that are built upon the knowledge of mechanical cooling, electrical and thermal management, which is difficult to design and may lead to sub-optimal or unstable performances. In this… ▽ More Cooling system plays a critical role in a modern data center (DC). Developing an optimal control policy for DC cooling system is a challenging task. The prevailing approaches often rely on approximating system models that are built upon the knowledge of mechanical cooling, electrical and thermal management, which is difficult to design and may lead to sub-optimal or unstable performances. In this paper, we propose utilizing the large amount of monitoring data in DC to optimize the control policy. To do so, we cast the cooling control policy design into an energy cost minimization problem with temperature constraints, and tap it into the emerging deep reinforcement learning (DRL) framework. Specifically, we propose an end-to-end cooling control algorithm (CCA) that is based on the actor-critic framework and an off-policy offline version of the deep deterministic policy gradient (DDPG) algorithm. In the proposed CCA, an evaluation network is trained to predict an energy cost counter penalized by the cooling status of the DC room, and a policy network is trained to predict optimized control settings when gave the current load and weather information. The proposed algorithm is evaluated on the EnergyPlus simulation platform and on a real data trace collected from the National Super Computing Centre (NSCC) of Singapore. Our results show that the proposed CCA can achieve about 11% cooling cost saving on the simulation platform compared with a manually configured baseline control algorithm. In the trace-based study, we propose a de-underestimation validation mechanism as we cannot directly test the algorithm on a real DC. Even though with DUE the results are conservative, we can still achieve about 15% cooling energy saving on the NSCC data trace if we set the inlet temperature threshold at 26.6 degree Celsius. △ Less

Submitted 18 July, 2018; v1 submitted 15 September, 2017; originally announced September 2017.

arXiv:1405.6825 [pdf, ps, other]

doi 10.1088/0004-637X/789/1/73

A Fan Beam Model for Radio Pulsars. I. Observational Evidence

Authors: Hong Guang Wang, Fei Peng Pi, Xiao Ping Zheng, Chun Lan Deng, Sai Qin Wen, Feng Ye, Kai Ying Guan, Yi Liu, Li Qing Xu

Abstract: We propose a novel beam model for radio pulsars based on the scenario that the broadband and coherent emission from secondary relativistic particles, as they move along a flux tube in a dipolar magnetic field, forms a radially extended sub-beam with unique properties. The whole radio beam may consist of several sub-beams, forming a fan-shaped pattern. When only one or a few flux tubes are active,… ▽ More We propose a novel beam model for radio pulsars based on the scenario that the broadband and coherent emission from secondary relativistic particles, as they move along a flux tube in a dipolar magnetic field, forms a radially extended sub-beam with unique properties. The whole radio beam may consist of several sub-beams, forming a fan-shaped pattern. When only one or a few flux tubes are active, the fan beam becomes very patchy. This model differs essentially from the conal beam models in the respects of beam structure and predictions on the relationship between pulse width and impact angle $β$ (the angle between line of sight and magnetic pole) and the relationship between emission intensity and beam angular radius. The evidence for this model comes from the observed patchy beams of precessional binary pulsars and three statistical relationships found for a sample of 64 pulsars, of which $β$ were mostly constrained by fitting polarization position angle data with the Rotation Vector Model. With appropriate assumptions, the fan beam model can reproduce the relationship between 10\% peak pulse width and $|β|$, the anticorrelation between the emission intensity and $|β|$, and the upper boundary line in the scatter plot of $|β|$ versus pulsar distance. An extremely patchy beam model with the assumption of narrowband emission from one or a few flux tubes is studied and found unlikely to be a general model. The implications of the fan beam model to the studies on radio and gamma-ray pulsar populations and radio polarization are discussed. △ Less

Submitted 27 May, 2014; originally announced May 2014.

Comments: 55 pages, 23 figures, 1 table. To be published in ApJ, 2014, Vol.789

Journal ref: 2014, ApJ, 789, 73

arXiv:1403.1511 [pdf]

Generalized Floquet Exponent, Attractiveness Portrait and Structure Hidden in an Attractor

Authors: Keying Guan

Abstract: The generalized Floquet exponent and the attractiveness portrait (or A-portrait for short) of the attractor and of the smallest invariant closed set are suggested to be used for the study of dynamical systems. Based on the A-portrait, some simple structures hidden in a complicated attractor may emerge from an attractor with complicated structure. The hidden structure plays important role in the bi… ▽ More The generalized Floquet exponent and the attractiveness portrait (or A-portrait for short) of the attractor and of the smallest invariant closed set are suggested to be used for the study of dynamical systems. Based on the A-portrait, some simple structures hidden in a complicated attractor may emerge from an attractor with complicated structure. The hidden structure plays important role in the bifurcation phenomena of the invariant sets. The examples of A-portraits for the Van der Pol limit cycle, for Lorenz attractor, for the closed limit orbits of different rotation numbers and complicated attractors of Silnikov equation, and for three interlocked smallest invariant closed set of the new improved Nosé-Hoover oscillator are given. △ Less

Submitted 6 March, 2014; originally announced March 2014.

Comments: 42 pages, 40 figures

MSC Class: 37c10; 37c70

Journal ref: Wolfgang Walter, Ordinary Differential Equations, Springer-Verlag New York, Inc. 1988, ISBN 0-387-98459-3

arXiv:1401.3315 [pdf]

Important Notes on Lyapunov Exponents

Authors: Keying Guan

Abstract: It is shown that the famous Lyapunov exponents cannot be used as the numerical characteristic for distinguishing different kinds of attractors, such as the equilibrium point, the limit closed curve, the stable torus and the strange attractor. It is shown that the famous Lyapunov exponents cannot be used as the numerical characteristic for distinguishing different kinds of attractors, such as the equilibrium point, the limit closed curve, the stable torus and the strange attractor. △ Less

Submitted 14 January, 2014; originally announced January 2014.

Comments: 18 pages, 11 figures

MSC Class: 37c10; 37c70

Journal ref: Cencini M. et al., M. Chaos From Simple models to complex systems. World Scientific, (2010). ISBN 981-4277-65-7

arXiv:1312.2043 [pdf]

Period-doubling cascades of a Silnikov equation

Authors: Keying Guan, Beiye Feng

Abstract: Based on numerical results of a Silnikov equation, three period-doubling cascades, corresponding respectively to three different characters of the rotation number of a limit closed orbit, are studied, and the Feigenbaum constant is used successfully in the estimation of the critical parameter values for the period-doubling bifurcation. The conceptions of separaror and pseudo-attractor are also int… ▽ More Based on numerical results of a Silnikov equation, three period-doubling cascades, corresponding respectively to three different characters of the rotation number of a limit closed orbit, are studied, and the Feigenbaum constant is used successfully in the estimation of the critical parameter values for the period-doubling bifurcation. The conceptions of separaror and pseudo-attractor are also introduced in the discussion part of this paper. △ Less

Submitted 15 December, 2013; v1 submitted 6 December, 2013; originally announced December 2013.

Comments: 20 pages, 41 figures

MSC Class: 37c10; 37c70

Journal ref: John Guckenheimer and Philip Holmes, Nonlinear Oscillations, Dynamical Systems, and Bifurcations of Vector Fields, Springer-Verlag, New York, 1983

arXiv:1311.6202 [pdf]

Non-trivial Local Attractors of a Three-dimensional Dynamical System

Authors: Keying Guan

Abstract: Based on both qualitative method and numerical tests for a series of particular cases in the parameter region, a=1, 0<b <1, it is shown that the three-dimensional system (2) may have a series of interesting phenomena on the non-trivial local attractors, such as the faint attractor (this term is suggested by the author), the local attractor with complex structure, twin spatial limit closed orbits,… ▽ More Based on both qualitative method and numerical tests for a series of particular cases in the parameter region, a=1, 0<b <1, it is shown that the three-dimensional system (2) may have a series of interesting phenomena on the non-trivial local attractors, such as the faint attractor (this term is suggested by the author), the local attractor with complex structure, twin spatial limit closed orbits, the bifurcation of rotation numbers, and the spatial limit cycle, etc.. The system (2) is a very rich source in the study of dynamical system theory. △ Less

Submitted 24 December, 2013; v1 submitted 24 November, 2013; originally announced November 2013.

Comments: 20 pages, 48 figures

MSC Class: 37c10; 37c70

Journal ref: John Guckenheimer and Philip Holmes, Nonlinear Oscillations,..., Springer-Verlag, New York, 1983 J.C. Sprott, Chaos and Time-series Analysis, Oxford University Press, 2003

arXiv:1304.4181 [pdf, ps, other]

doi 10.1109/TCOMM.2014.010914.130256

Rate-Distortion-Based Physical Layer Secrecy with Applications to Multimode Fiber

Authors: Eva C. Song, Emina Soljanin, Paul Cuff, H. Vincent Poor, Kyle Guan

Abstract: Optical networks are vulnerable to physical layer attacks; wiretappers can improperly receive messages intended for legitimate recipients. Our work considers an aspect of this security problem within the domain of multimode fiber (MMF) transmission. MMF transmission can be modeled via a broadcast channel in which both the legitimate receiver's and wiretapper's channels are multiple-input-multiple-… ▽ More Optical networks are vulnerable to physical layer attacks; wiretappers can improperly receive messages intended for legitimate recipients. Our work considers an aspect of this security problem within the domain of multimode fiber (MMF) transmission. MMF transmission can be modeled via a broadcast channel in which both the legitimate receiver's and wiretapper's channels are multiple-input-multiple-output complex Gaussian channels. Source-channel coding analyses based on the use of distortion as the metric for secrecy are developed. Alice has a source sequence to be encoded and transmitted over this broadcast channel so that the legitimate user Bob can reliably decode while forcing the distortion of wiretapper, or eavesdropper, Eve's estimate as high as possible. Tradeoffs between transmission rate and distortion under two extreme scenarios are examined: the best case where Eve has only her channel output and the worst case where she also knows the past realization of the source. It is shown that under the best case, an operationally separate source-channel coding scheme guarantees maximum distortion at the same rate as needed for reliable transmission. Theoretical bounds are given, and particularized for MMF. Numerical results showing the rate distortion tradeoff are presented and compared with corresponding results for the perfect secrecy case. △ Less

Submitted 29 December, 2013; v1 submitted 15 April, 2013; originally announced April 2013.

Comments: 30 pages, 5 figures, accepted to IEEE Transactions on Communications

Journal ref: IEEE Trans. on Communications, 62(3):1080-90, March, 2014

Showing 1–47 of 47 results for author: Guan, K