Skip Content
You are currently on the new version of our website. Access the old version .

2,052 Results Found

  • Article
  • Open Access
6 Citations
2,266 Views
18 Pages

Intelligent Robot in Unknown Environments: Walk Path Using Q-Learning and Deep Q-Learning

  • Mouna El Wafi,
  • My Abdelkader Youssefi,
  • Rachid Dakir and
  • Mohamed Bakir

Autonomous navigation is essential for mobile robots to efficiently operate in complex environments. This study investigates Q-learning and Deep Q-learning to improve navigation performance. The research examines their effectiveness in complex maze c...

  • Article
  • Open Access
36 Citations
4,415 Views
16 Pages

A Self-Adaptive Reinforcement-Exploration Q-Learning Algorithm

  • Lieping Zhang,
  • Liu Tang,
  • Shenglan Zhang,
  • Zhengzhong Wang,
  • Xianhao Shen and
  • Zuqiong Zhang

11 June 2021

Directing at various problems of the traditional Q-Learning algorithm, such as heavy repetition and disequilibrium of explorations, the reinforcement-exploration strategy was used to replace the decayed ε-greedy strategy in the traditional Q-Learning...

  • Article
  • Open Access
19 Citations
3,700 Views
14 Pages

11 March 2021

In this paper, we address the application of the flying Drone Base Stations (DBS) in order to improve the network performance. Given the high degrees of freedom of a DBS, it can change its position and adapt its trajectory according to the users move...

  • Article
  • Open Access
1 Citations
1,036 Views
18 Pages

Network security and intrusion detection and response (IDR) are necessary issues nowadays. Enhancing our cyber defense by discovering advanced machine learning models, such as reinforcement learning and Q-learning, is a crucial security measure. This...

  • Article
  • Open Access
1,444 Views
16 Pages

XSQ-Learning: Adaptive Similarity Thresholds for Accelerated and Stable Q-Learning

  • Ansel Y. Rodríguez González,
  • Roberto E. López Díaz,
  • Shender M. Ávila Sansores and
  • María G. Sánchez Cervantes

27 June 2025

Reinforcement Learning (RL) enables agents to learn optimal policies through environment interaction, with Q-learning being a fundamental algorithm for Markov Decision Processes (MDPs). However, Q-learning suffers from slow convergence due to its exh...

  • Article
  • Open Access
3 Citations
3,543 Views
13 Pages

22 March 2020

In this paper, the Q-learning method for quadratic optimal control problem of discrete-time linear systems is reconsidered. The theoretical results prove that the quadratic optimal controller cannot be solved directly due to the linear correlation of...

  • Article
  • Open Access
21 Citations
5,452 Views
18 Pages

Indoor Emergency Path Planning Based on the Q-Learning Optimization Algorithm

  • Shenghua Xu,
  • Yang Gu,
  • Xiaoyan Li,
  • Cai Chen,
  • Yingyi Hu,
  • Yu Sang and
  • Wenxing Jiang

The internal structure of buildings is becoming increasingly complex. Providing a scientific and reasonable evacuation route for trapped persons in a complex indoor environment is important for reducing casualties and property losses. In emergency an...

  • Article
  • Open Access
3 Citations
3,148 Views
22 Pages

Scaling Up Q-Learning via Exploiting State–Action Equivalence

  • Yunlian Lyu,
  • Aymeric Côme,
  • Yijie Zhang and
  • Mohammad Sadegh Talebi

29 March 2023

Recent success stories in reinforcement learning have demonstrated that leveraging structural properties of the underlying environment is key in devising viable methods capable of solving complex tasks. We study off-policy learning in discounted rein...

  • Article
  • Open Access
11 Citations
3,919 Views
13 Pages

Oil Production Optimization Using Q-Learning Approach

  • Mazyar Zahedi-Seresht,
  • Bahram Sadeghi Bigham,
  • Shahrzad Khosravi and
  • Hoda Nikpour

2 January 2024

This paper presents an approach for optimizing the oil recovery factor by determining initial oil production rates. The proposed method utilizes the Q-learning method and the reservoir simulator (Eclipse 100) to achieve the desired objective. The sys...

  • Article
  • Open Access
5 Citations
3,025 Views
20 Pages

30 August 2020

In order to maximize energy efficiency in heterogeneous networks (HetNets), a turbo Q-Learning (TQL) combined with multistage decision process and tabular Q-Learning is proposed to optimize the resource configuration. For the large dimensions of acti...

  • Article
  • Open Access
60 Citations
9,786 Views
13 Pages

Autonomous Driving in Roundabout Maneuvers Using Reinforcement Learning with Q-Learning

  • Laura García Cuenca,
  • Enrique Puertas,
  • Javier Fernandez Andrés and
  • Nourdine Aliane

13 December 2019

Navigating roundabouts is a complex driving scenario for both manual and autonomous vehicles. This paper proposes an approach based on the use of the Q-learning algorithm to train an autonomous vehicle agent to learn how to appropriately navigate rou...

  • Article
  • Open Access
793 Views
10 Pages

29 August 2025

Vehicle routing improvement has become a vital topic in modern transport digitalization projects. Presently, there are no fully adapted techniques to offer optimal solutions for finding the best routes that include all visiting locations, considering...

  • Article
  • Open Access
22 Citations
4,353 Views
23 Pages

14 December 2021

Unmanned aerial vehicle (UAV) clusters usually face problems such as complex environments, heterogeneous combat subjects, and realistic interference factors in the course of mission assignment. In order to reduce resource consumption and improve the...

  • Article
  • Open Access
4 Citations
2,523 Views
12 Pages

25 March 2024

This research paper presents the Buckley-James Q-learning (BJ-Q) algorithm, a cutting-edge method designed to optimize personalized treatment strategies, especially in the presence of right censoring. We critically assess the algorithm’s effect...

  • Article
  • Open Access
20 Citations
6,934 Views
50 Pages

21 August 2024

This paper presents a comprehensive study on the optimization of electric vehicle (EV) battery management using Q-learning, a powerful reinforcement learning technique. As the demand for electric vehicles continues to grow, there is an increasing nee...

  • Article
  • Open Access
2 Citations
1,444 Views
15 Pages

2 December 2024

Due to the severe threats posed by smart jammers, anti-jamming decision making has become an essential technology for wireless communications. Most of the existing anti-jamming decision-making approaches have adopted Q-Learning to improve accuracy. H...

  • Proceeding Paper
  • Open Access
543 Views
10 Pages

Adaptive Q-Learning in Noisy Environments: A Study on Sensor Noise Influence

  • Mouna El Wafi,
  • My Abdelkader Youssefi,
  • Rachid Dakir,
  • Mohamed Bakir and
  • Younes El Koudia

14 October 2025

Reinforcement learning, particularly Q-learning, has demonstrated significant potential in autonomous navigation applications. However, the environments of the real world introduce sensor noise, which can impact learning efficiency and decision-makin...

  • Article
  • Open Access
1,131 Views
22 Pages

Stomach ulcers, a common type of gastrointestinal (GI) disease, pose serious health risks if not diagnosed and treated at an early stage. Therefore, in this research, a method is proposed based on two deep learning models for classification and segme...

  • Article
  • Open Access
1 Citations
978 Views
22 Pages

21 July 2025

In the era of rapid industrial automation advancements, the complexity of intelligent manufacturing equipment has been steadily escalated. Stringent demands for high-efficiency and high-precision diagnosis are increasingly being unmet by conventional...

  • Article
  • Open Access
36 Citations
5,745 Views
17 Pages

26 December 2021

Modern adaptive radars can switch work modes to perform various missions and simultaneously use pulse parameter agility in each mode to improve survivability, which leads to a multiplicative increase in the decision-making complexity and declining pe...

  • Article
  • Open Access
12 Citations
3,353 Views
20 Pages

An accurate mathematical model is a basis for controlling and estimating the state of an Autonomous underwater vehicle (AUV) system, so how to improve its accuracy is a fundamental problem in the field of automatic control. However, AUV systems are c...

  • Article
  • Open Access
4 Citations
2,460 Views
17 Pages

rl4dtn: Q-Learning for Opportunistic Networks

  • Jorge Visca and
  • Javier Baliosian

23 November 2022

Opportunistic networks are highly stochastic networks supported by sporadic encounters between mobile devices. To route data efficiently, opportunistic-routing algorithms must capitalize on devices’ movement and data transmission patterns. This...

  • Article
  • Open Access
7 Citations
2,300 Views
18 Pages

An Improved Q-Learning Algorithm for Optimizing Sustainable Remanufacturing Systems

  • Shujin Qin,
  • Xiaofei Zhang,
  • Jiacun Wang,
  • Xiwang Guo,
  • Liang Qi,
  • Jinrui Cao and
  • Yizhi Liu

16 May 2024

In our modern society, there has been a noticeable increase in pollution due to the trend of post-use handling of items. This necessitates the adoption of recycling and remanufacturing processes, advocating for sustainable resource management. This p...

  • Article
  • Open Access
23 Citations
6,140 Views
13 Pages

30 September 2018

This paper proposes a hybrid Zeigler-Nichols (Z-N) reinforcement learning approach for online tuning of the parameters of the Proportional Integral Derivative (PID) for controlling the speed of a DC motor. The PID gains are set by the Z-N method, and...

  • Article
  • Open Access
16 Citations
3,813 Views
24 Pages

Coexistence Scheme for Uncoordinated LTE and WiFi Networks Using Experience Replay Based Q-Learning

  • Merkebu Girmay,
  • Vasilis Maglogiannis,
  • Dries Naudts,
  • Adnan Shahid and
  • Ingrid Moerman

21 October 2021

Nowadays, broadband applications that use the licensed spectrum of the cellular network are growing fast. For this reason, Long-Term Evolution-Unlicensed (LTE-U) technology is expected to offload its traffic to the unlicensed spectrum. However, LTE-U...

  • Article
  • Open Access
4 Citations
2,912 Views
13 Pages

Environmental Perception Q-Learning to Prolong the Lifetime of Poultry Farm Monitoring Networks

  • Zike Wu,
  • Pan Pan,
  • Jieqiang Liu,
  • Beibei Shi,
  • Ming Yan and
  • Hongguang Zhang

3 December 2021

The reduction of the effects of heat-stress phenomena on poultry health and energy conservation of poultry farm monitoring networks are highly related problems. To address these problems, we propose environmental perception Q-learning (EPQL) to prolo...

  • Article
  • Open Access
2,954 Views
14 Pages

Q-Learning with the Variable Box Method: A Case Study to Land a Solid Rocket

  • Alejandro Tevera-Ruiz,
  • Rodolfo Garcia-Rodriguez,
  • Vicente Parra-Vega and
  • Luis Enrique Ramos-Velasco

2 February 2023

Some critical tasks require refined actions near the target, for instance, steering a car in a crowded parking lot or landing a rocket. These tasks are critical because failure to comply with the constraints near the target may lead to a fatal (unrec...

  • Article
  • Open Access
13 Citations
3,077 Views
28 Pages

27 May 2023

With the accelerated development of smart cities, the concept of a “smart industrial park” in which unmanned ground vehicles (UGVs) have wide application has entered the industrial field of vision. When faced with multiple tasks and heter...

  • Article
  • Open Access
32 Citations
4,211 Views
14 Pages

11 May 2023

Complete coverage path planning requires that the mobile robot traverse all reachable positions in the environmental map. Aiming at the problems of local optimal path and high path coverage ratio in the complete coverage path planning of the traditio...

  • Article
  • Open Access
2 Citations
1,881 Views
15 Pages

13 August 2024

Tracking control of the output probability density function presents significant challenges, particularly when dealing with unknown system models and multiplicative noise disturbances. To address these challenges, this paper introduces a novel tracki...

  • Article
  • Open Access
12 Citations
3,501 Views
17 Pages

Robust Attitude Control of an Agile Aircraft Using Improved Q-Learning

  • Mohsen Zahmatkesh,
  • Seyyed Ali Emami,
  • Afshin Banazadeh and
  • Paolo Castaldi

12 December 2022

Attitude control of a novel regional truss-braced wing (TBW) aircraft with low stability characteristics is addressed in this paper using Reinforcement Learning (RL). In recent years, RL has been increasingly employed in challenging applications, par...

  • Article
  • Open Access
7 Citations
3,016 Views
21 Pages

31 January 2023

The use of multiple mobile robots has grown significantly over the past few years in logistics, manufacturing and public services. Conflict–free route planning is one of the major research challenges for such mobile robots. Optimization methods...

  • Article
  • Open Access
9 Citations
3,808 Views
24 Pages

22 August 2022

This paper addresses the problem of detecting multiple static and mobile targets by an autonomous mobile agent acting under uncertainty. It is assumed that the agent is able to detect targets at different distances and that the detection includes err...

  • Article
  • Open Access
12 Citations
2,708 Views
10 Pages

6 October 2021

Conventional optimization-based relay selection for multihop networks cannot resolve the conflict between performance and cost. The optimal selection policy is centralized and requires local channel state information (CSI) of all hops, leading to hig...

  • Article
  • Open Access
7 Citations
2,935 Views
32 Pages

Enhancing the Efficiency of a Cybersecurity Operations Center Using Biomimetic Algorithms Empowered by Deep Q-Learning

  • Rodrigo Olivares,
  • Omar Salinas,
  • Camilo Ravelo,
  • Ricardo Soto and
  • Broderick Crawford

In the complex and dynamic landscape of cyber threats, organizations require sophisticated strategies for managing Cybersecurity Operations Centers and deploying Security Information and Event Management systems. Our study enhances these strategies b...

  • Article
  • Open Access
5 Citations
3,035 Views
17 Pages

13 October 2022

Deep reinforcement learning (DRL) algorithms interact with the environment and have achieved considerable success in several decision-making problems. However, DRL requires a significant number of data before it can achieve adequate performance. More...

  • Article
  • Open Access
8 Citations
2,495 Views
12 Pages

A Novel Relay Selection Scheme Based on Q-Learning in Multi-Hop Wireless Networks

  • Min-Jae Paek,
  • Yu-Jin Na,
  • Won-Seok Lee,
  • Jae-Hyun Ro and
  • Hyoung-Kyu Song

30 July 2020

In wireless communication systems, reliability, low latency and power are essential in large scale multi-hop environment. Multi-hop based cooperative communication is an efficient way to achieve goals of wireless networks. This paper proposes a relay...

  • Article
  • Open Access
31 Citations
8,465 Views
20 Pages

9 October 2015

In order to realize the online learning of a hybrid electric vehicle (HEV) control strategy, a fuzzy Q-learning (FQL) method is proposed in this paper. FQL control strategies consists of two parts: The optimal action-value function Q*(x,u) estimator...

  • Article
  • Open Access
3 Citations
1,987 Views
21 Pages

Efficient Jamming Policy Generation Method Based on Multi-Timescale Ensemble Q-Learning

  • Jialong Qian,
  • Qingsong Zhou,
  • Zhihui Li,
  • Zhongping Yang,
  • Shasha Shi,
  • Zhenjia Xu and
  • Qiyun Xu

27 August 2024

With the advancement of radar technology toward multifunctionality and cognitive capabilities, traditional radar countermeasures are no longer sufficient to meet the demands of countering the advanced multifunctional radar (MFR) systems. Rapid and ac...

  • Feature Paper
  • Article
  • Open Access
14 Citations
6,441 Views
32 Pages

Successful Pass Schedule Design in Open-Die Forging Using Double Deep Q-Learning

  • Niklas Reinisch,
  • Fridtjof Rudolph,
  • Stefan Günther,
  • David Bailly and
  • Gerhard Hirt

22 June 2021

In order to not only produce an open-die forged part with the desired final geometry but to also maintain economic production, precise process planning is necessary. However, due to the incremental forming of the billet, often with several hundred st...

  • Article
  • Open Access
11 Citations
5,464 Views
19 Pages

Traffic Light Cycle Configuration of Single Intersection Based on Modified Q-Learning

  • Hung-Chi Chu,
  • Yi-Xiang Liao,
  • Lin-huang Chang and
  • Yen-Hsi Lee

27 October 2019

In recent years, within large cities with a high population density, traffic congestion has become more and more serious, resulting in increased emissions of vehicles and reducing the efficiency of urban operations. Many factors have caused traffic c...

  • Article
  • Open Access
30 Citations
7,191 Views
16 Pages

Deep Q-Learning in Robotics: Improvement of Accuracy and Repeatability

  • Marius Sumanas,
  • Algirdas Petronis,
  • Vytautas Bucinskas,
  • Andrius Dzedzickis,
  • Darius Virzonis and
  • Inga Morkvenaite-Vilkonciene

21 May 2022

Recent industrial robotics covers a broad part of the manufacturing spectrum and other human everyday life applications; the performance of these devices has become increasingly important. Positioning accuracy and repeatability, as well as operating...

  • Article
  • Open Access
385 Views
19 Pages

24 December 2025

This study introduces a model-free reinforcement learning framework based on Q-Learning (QLA) for the multi-objective optimization of Selective Laser Melting (SLM) process parameters for Inconel 718. To efficiently handle the limited experimental dat...

  • Article
  • Open Access
7 Citations
2,549 Views
15 Pages

Evaluating Deep Q-Learning Algorithms for Controlling Blood Glucose in In Silico Type 1 Diabetes

  • Miguel Tejedor,
  • Sigurd Nordtveit Hjerde,
  • Jonas Nordhaug Myhre and
  • Fred Godtliebsen

7 October 2023

Patients with type 1 diabetes must continually decide how much insulin to inject before each meal to maintain blood glucose levels within a healthy range. Recent research has worked on a solution for this burden, showing the potential of reinforcemen...

  • Article
  • Open Access
4 Citations
1,614 Views
21 Pages

14 May 2024

This article investigates the optimal tracking control problem for data-based stochastic discrete-time linear systems. An average off-policy Q-learning algorithm is proposed to solve the optimal control problem with random disturbances. Compared with...

  • Article
  • Open Access
6 Citations
2,799 Views
19 Pages

Real-Time Data Transmission Scheduling Algorithm for Wireless Sensor Networks Based on Deep Q-Learning

  • Aiqi Zhang,
  • Meiyi Sun,
  • Jiaqi Wang,
  • Zhiyi Li,
  • Yanbo Cheng and
  • Cheng Wang

In the industrial environment, the data transmission of Wireless Sensor Networks (WSNs) usually has strict deadline requirements. Improving the reliability and real-time performance of data transmission has become one of the critical issues in WSNs r...

  • Article
  • Open Access
22 Citations
3,421 Views
11 Pages

17 July 2019

Navigation systems can help in allocating public charging stations to electric vehicles (EVs) with the aim of minimizing EVs’ charging time by integrating sufficient data. However, the existing systems only consider their travel time and transf...

  • Article
  • Open Access
4 Citations
2,379 Views
19 Pages

14 May 2023

Path planning in complex environments remains a challenging task for unmanned vehicles. In this paper, we propose a decoupled path-planning algorithm with the help of a deep reinforcement learning algorithm that separates the evaluation of paths from...

  • Article
  • Open Access
14 Citations
2,420 Views
23 Pages

19 January 2024

This study focuses on the scheduling problem of heterogeneous unmanned surface vehicles (USVs) with obstacle avoidance pretreatment. The goal is to minimize the overall maximum completion time of USVs. First, we develop a mathematical model for the p...

  • Article
  • Open Access
27 Citations
3,730 Views
18 Pages

An Optimal Scheduling Strategy of a Microgrid with V2G Based on Deep Q-Learning

  • Yuxin Wen,
  • Peixiao Fan,
  • Jia Hu,
  • Song Ke,
  • Fuzhang Wu and
  • Xu Zhu

19 August 2022

In recent years, the access of various distributed power sources and electric vehicles (EVs) has brought more and more randomness and uncertainty to the operation and regulation of microgrids. Therefore, an optimal scheduling strategy for microgrids...

of 42