Next Article in Journal
Enhancing Syslog Message Security and Reliability over Unidirectional Fiber Optics
Previous Article in Journal
Design of Docking Interfaces for On-Orbit Assembly of Large Structures in Space
Previous Article in Special Issue
Jointly Optimization of Delay and Energy Consumption for Multi-Device FDMA in WPT-MEC System
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Multi-Agent DRL for Air-to-Ground Communication Planning in UAV-Enabled IoT Networks

Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, School of Software, Dalian University of Technology, Dalian 116024, China
*
Authors to whom correspondence should be addressed.
Sensors 2024, 24(20), 6535; https://doi.org/10.3390/s24206535
Submission received: 24 September 2024 / Revised: 8 October 2024 / Accepted: 9 October 2024 / Published: 10 October 2024

Abstract

In this paper, we present a novel method to enhance the sum-rate effectiveness in full-duplex unmanned aerial vehicle (UAV)-assisted communication networks. Existing approaches often couple uplink and downlink associations, resulting in suboptimal performance, particularly in dynamic environments where user demands and network conditions are unpredictable. To overcome these limitations, we propose a decoupling of uplink and downlink associations for ground-based users (GBUs), significantly improving network efficiency. We formulate a comprehensive optimization problem that integrates UAV trajectory design and user association, aiming to maximize the overall sum-rate efficiency of the network. Due to the problem’s non-convexity, we reformulate it as a Partially Observable Markov Decision Process (POMDP), enabling UAVs to make real-time decisions based on local observations without requiring complete global information. Our framework employs multi-agent deep reinforcement learning (MADRL), specifically the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm, which balances centralized training with distributed execution. This allows UAVs to efficiently learn optimal user associations and trajectory controls while dynamically adapting to local conditions. The proposed solution is particularly suited for critical applications such as disaster response and search and rescue missions, highlighting the practical significance of utilizing UAVs for rapid network deployment in emergencies. By addressing the limitations of existing centralized and distributed solutions, our hybrid model combines the benefits of centralized training with the adaptability of distributed inference, ensuring optimal UAV operations in real-time scenarios.
Keywords: IoUAVs; MADRL; SDN IoUAVs; MADRL; SDN

Share and Cite

MDPI and ACS Style

Qureshi, K.I.; Lu, B.; Lu, C.; Lodhi, M.A.; Wang, L. Multi-Agent DRL for Air-to-Ground Communication Planning in UAV-Enabled IoT Networks. Sensors 2024, 24, 6535. https://doi.org/10.3390/s24206535

AMA Style

Qureshi KI, Lu B, Lu C, Lodhi MA, Wang L. Multi-Agent DRL for Air-to-Ground Communication Planning in UAV-Enabled IoT Networks. Sensors. 2024; 24(20):6535. https://doi.org/10.3390/s24206535

Chicago/Turabian Style

Qureshi, Khalid Ibrahim, Bingxian Lu, Cheng Lu, Muhammad Ali Lodhi, and Lei Wang. 2024. "Multi-Agent DRL for Air-to-Ground Communication Planning in UAV-Enabled IoT Networks" Sensors 24, no. 20: 6535. https://doi.org/10.3390/s24206535

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop