Backward Q-learning: The combination of Sarsa algorithm and Q-learning

YH Wang, THS Li, CJ Lin - Engineering Applications of Artificial Intelligence, 2013 - Elsevier
Reinforcement learning (RL) has been applied to many fields and applications, but there are
still some dilemmas between exploration and exploitation strategy for action selection policy …

A review of research on reinforcement learning algorithms for multi-agents

K Hu, M Li, Z Song, K Xu, Q Xia, N Sun, P Zhou, M Xia - Neurocomputing, 2024 - Elsevier
In recent years, multi-agent reinforcement learning techniques have been widely used and
evolved in the field of artificial intelligence. However, traditional reinforcement learning …

Traffic flow management of autonomous vehicles using deep reinforcement learning and smart rerouting

A Mushtaq, IU Haq, MU Imtiaz, A Khan, O Shafiq - IEEE Access, 2021 - ieeexplore.ieee.org
Autonomous Vehicles (AVs) promise to disrupt the traditional systems of transportation. An
autonomous driving environment requires an uninterrupted, continuous stream of data and …

Supervised fuzzy reinforcement learning for robot navigation

F Fathinezhad, V Derhami, M Rezaeian - Applied Soft Computing, 2016 - Elsevier
This paper addresses a new method for combination of supervised learning and
reinforcement learning (RL). Applying supervised learning in robot navigation encounters …

Energy Hub optimal sizing in the smart grid; machine learning approach

A Sheikhi, M Rayati, AM Ranjbar - 2015 IEEE Power & Energy …, 2015 - ieeexplore.ieee.org
The interests in “Energy Hub”(EH) and “Smart Grid”(SG) concepts have been increasing, in
recent years. The synergy effect of the coupling between electricity and natural gas grids …

Generalizing fuzzy SARSA learning for real-time operation of irrigation canals

K Shahverdi, JM Maestre, F Alamiyan-Harandi, X Tian - Water, 2020 - mdpi.com
Recently, a continuous reinforcement learning model called fuzzy SARSA (state, action,
reward, state, action) learning (FSL) was proposed for irrigation canals. The main problem …

Synthesized design of a fuzzy logic controller for an underactuated unicycle

JX Xu, ZQ Guo, TH Lee - Fuzzy Sets and Systems, 2012 - Elsevier
In this paper, we propose synthesized design of a fuzzy logic controller (FLC) for control of
an underactuated unicycle system. The FLC objective is velocity control of the wheel while …

Towards learning behavior modeling of military logistics agent utilizing profit sharing reinforcement learning algorithm

X Li, W Pu, X Zhao - Applied Soft Computing, 2021 - Elsevier
Agent-based modeling has become a beneficial tool in describing the complex and
intelligent decision-making behaviors of military logistics entities, which is essential in …

A wavelet-based robust adaptive TS fuzzy controller design for synchronization of faulty chaotic gyrostat systems

Y Farid, A Ramezani - Journal of Control, Automation and Electrical …, 2021 - Springer
In this study, the synchronization problem of the faulty chaotic gyrostat master–slave systems
using a wavelet-based robust adaptive TS fuzzy control is investigated. This control system …

多智能体强化学习算法研究综述.

李明阳, 许可儿, 宋志强, 夏庆锋… - Journal of Frontiers of …, 2024 - search.ebscohost.com
近年来, 多智能体强化学习算法技术已广泛应用于人工智能领域. 系统性地分析了多智能体强化
学习算法, 审视了其在多智能体系统中的应用与进展, 并深入调研了相关研究成果 …