- Article
Segmenting Action-Value Functions over Time Scales in SARSA via TD(Δ)
- Mahammad Humayoo,
- Gengzhong Zheng,
- Xiaoqing Dong,
- Wei Huang,
- Liming Miao,
- Shuwei Qiu,
- Zexun Zhou,
- Peitao Wang,
- Zakir Ullah and
- Xueqi Cheng
- + 1 author
In numerous episodic reinforcement learning (RL) environments, SARSA-based methodologies are employed to enhance policies aimed at maximizing returns over long horizons. Traditional SARSA algorithms face challenges in achieving an optimal balance bet...

