Next Article in Journal
Analysis of Bubble Flow in an Inclined Tube and Modeling of Flow Prediction
Previous Article in Journal
Time Optimal Altitude-Hold Flight Mode Transition Strategy for a Class of Ducted Fan Tail Sitter UAV
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

An Intelligent Bait Delivery Control Method for Flight Vehicle Evasion Based on Reinforcement Learning

by
Shuai Xue
1,
Zhaolei Wang
2,
Hongyang Bai
1,*,
Chunmei Yu
2,
Tianyu Deng
1 and
Ruisheng Sun
1
1
School of Energy and Power Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
2
National Key Laboratory of Science and Technology on Aerospace Intelligence Control, Beijing Aerospace Automatic Control Institute, Beijing 100854, China
*
Author to whom correspondence should be addressed.
Aerospace 2024, 11(8), 653; https://doi.org/10.3390/aerospace11080653 (registering DOI)
Submission received: 13 June 2024 / Revised: 5 August 2024 / Accepted: 7 August 2024 / Published: 11 August 2024
(This article belongs to the Section Aeronautics)

Abstract

During aerial combat, when an aircraft is facing an infrared air-to-air missile strike, infrared baiting technology is an important means of penetration, and the strategy of effective delivery of infrared bait is critical. To address this issue, this study proposes an improved deep deterministic policy gradient (DDPG) algorithm-based intelligent bait-dropping control method. Firstly, by modeling the relative motion between aircraft, bait, and incoming missiles, the Markov decision process of aircraft-bait-missile infrared effect was constructed with visual distance and line of sight angle as states. Then, the DDPG algorithm was improved by means of pre-training and classification sampling. Significantly, the infrared bait-dropping decision network was trained through interaction with the environment and iterative learning, which led to the development of the bait-dropping strategy. Finally, the corresponding environment was transferred to the Nvidia Jetson TX2 embedded platform for comparative testing. The simulation results showed that the convergence speed of this method was 46.3% faster than the traditional DDPG algorithm. More importantly, it was able to generate an effective bait-throwing strategy, enabling the aircraft to successfully evade the attack of the incoming missile. The strategy instruction generation time is only about 2.5 ms, giving it the ability to make online decisions.
Keywords: infrared bait; deep deterministic policy gradient; infrared interference; dropping strategy; online decision infrared bait; deep deterministic policy gradient; infrared interference; dropping strategy; online decision

Share and Cite

MDPI and ACS Style

Xue, S.; Wang, Z.; Bai, H.; Yu, C.; Deng, T.; Sun, R. An Intelligent Bait Delivery Control Method for Flight Vehicle Evasion Based on Reinforcement Learning. Aerospace 2024, 11, 653. https://doi.org/10.3390/aerospace11080653

AMA Style

Xue S, Wang Z, Bai H, Yu C, Deng T, Sun R. An Intelligent Bait Delivery Control Method for Flight Vehicle Evasion Based on Reinforcement Learning. Aerospace. 2024; 11(8):653. https://doi.org/10.3390/aerospace11080653

Chicago/Turabian Style

Xue, Shuai, Zhaolei Wang, Hongyang Bai, Chunmei Yu, Tianyu Deng, and Ruisheng Sun. 2024. "An Intelligent Bait Delivery Control Method for Flight Vehicle Evasion Based on Reinforcement Learning" Aerospace 11, no. 8: 653. https://doi.org/10.3390/aerospace11080653

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop