Deep reinforcement learning based trajectory design and resource allocation for task-aware multi-UAV enabled MEC networks

Z Li, C Xu, Z Zhang, R Wu�- Computer Communications, 2024 - Elsevier
Z Li, C Xu, Z Zhang, R Wu
Computer Communications, 2024Elsevier
Computing tasks in the air is an important form of mobile edge computing (MEC) to improve
the quality of service and enhance network coverage. In this paper, we investigate a multi-
UAV cooperative computing model and massive devices access scenario in a service area
without infrastructure. There are various types of ground devices with different tasks.
Moreover, we consider that the UAV executing tasks of devices need to cache the content
that task required. Therefore, we propose a multi-UAV enabled MEC network based on task�…
Abstract
Computing tasks in the air is an important form of mobile edge computing (MEC) to improve the quality of service and enhance network coverage. In this paper, we investigate a multi-UAV cooperative computing model and massive devices access scenario in a service area without infrastructure. There are various types of ground devices with different tasks. Moreover, we consider that the UAV executing tasks of devices need to cache the content that task required. Therefore, we propose a multi-UAV enabled MEC network based on task awareness where each UAV caches some programs to execute tasks offloaded from devices. To minimize completion time, a joint UAV trajectory design, access decision and resource allocation problem is formulated. To address this intractable mixed integer non-linear programming problem, a multi-agent trajectory design and resource allocation (MATR) algorithm is proposed, where the multi-agent deep deterministic policy gradient (MADDPG) is applied. Considering the complexity of high-dimensional continuous action space, we introduce the particle swarm optimization (PSO) algorithm to jointly optimize access decisions, and computation resource allocation to reduce action space. In addition, we discuss the impact of the size of UAV cache space and the location of ground devices on the completion time. Simulation results show that the MATR algorithm can significantly reduce the completion time compared to the baselines.
Elsevier
Showing the best result for this search. See all results