Exploration and exploitation balance management in fuzzy reinforcement learning
This paper offers a fuzzy balance management scheme between exploration and
exploitation, which can be implemented in any critic-only fuzzy reinforcement learning
method. The paper, however, focuses on a newly developed continuous reinforcement
learning method, called fuzzy Sarsa learning (FSL) due to its advantages. Establishing
balance greatly depends on the accuracy of action value function approximation. At first, the
overfitting problem in approximating action value function in continuous reinforcement …
exploitation, which can be implemented in any critic-only fuzzy reinforcement learning
method. The paper, however, focuses on a newly developed continuous reinforcement
learning method, called fuzzy Sarsa learning (FSL) due to its advantages. Establishing
balance greatly depends on the accuracy of action value function approximation. At first, the
overfitting problem in approximating action value function in continuous reinforcement …
以上显示的是最相近的搜索结果。 查看全部搜索结果