Next Article in Journal
Effect of Indentation Depth on Friction Coefficient in Adhesive Contacts: Experiment and Simulation
Next Article in Special Issue
Intermittent Stop-Move Motion Planning for Dual-Arm Tomato Harvesting Robot in Greenhouse Based on Deep Reinforcement Learning
Previous Article in Journal
Diatomite-Based Recyclable and Green Coating for Efficient Radiative Cooling
Previous Article in Special Issue
Underactuated Humanoid Peeling Approach for Pickled Mustard Tuber Based on Metamorphic Constraints
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay

1
Department of Electronic Engineering, Yeungnam University, Gyeongsan 38541, Republic of Korea
2
School of Electronics Engineering, Kyungpook National University, Daegu 41566, Republic of Korea
*
Author to whom correspondence should be addressed.
Biomimetics 2024, 9(1), 51; https://doi.org/10.3390/biomimetics9010051
Submission received: 1 November 2023 / Revised: 10 January 2024 / Accepted: 11 January 2024 / Published: 13 January 2024
(This article belongs to the Special Issue Artificial Intelligence for Autonomous Robots 2024)

Abstract

In this paper, we propose a reinforcement learning-based end-to-end learning method for the autonomous driving of a mobile robot in a dynamic environment with obstacles. Applying two additional techniques for reinforcement learning simultaneously helps the mobile robot in finding an optimal policy to reach the destination without collisions. First, the multifunctional reward-shaping technique guides the agent toward the goal by utilizing information about the destination and obstacles. Next, employing the hindsight experience replay technique to address the experience imbalance caused by the sparse reward problem assists the agent in finding the optimal policy. We validated the proposed technique in both simulation and real-world environments. To assess the effectiveness of the proposed method, we compared experiments for five different cases.
Keywords: deep deterministic policy gradient; multifunctional reward shaping; hindsight experience replay; mobile robot; autonomous driving deep deterministic policy gradient; multifunctional reward shaping; hindsight experience replay; mobile robot; autonomous driving

Share and Cite

MDPI and ACS Style

Park, M.; Park, C.; Kwon, N.K. Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay. Biomimetics 2024, 9, 51. https://doi.org/10.3390/biomimetics9010051

AMA Style

Park M, Park C, Kwon NK. Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay. Biomimetics. 2024; 9(1):51. https://doi.org/10.3390/biomimetics9010051

Chicago/Turabian Style

Park, Minjae, Chaneun Park, and Nam Kyu Kwon. 2024. "Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay" Biomimetics 9, no. 1: 51. https://doi.org/10.3390/biomimetics9010051

APA Style

Park, M., Park, C., & Kwon, N. K. (2024). Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay. Biomimetics, 9(1), 51. https://doi.org/10.3390/biomimetics9010051

Article Metrics

Back to TopTop