Graph attention mechanism based reinforcement learning for multi-agent flocking control in communication-restricted environment
J Xiao, G Yuan, J He, K Fang, Z Wang - Information Sciences, 2023 - Elsevier
To solve the poor performance of reinforcement learning (RL) in the multi-agent flocking
cooperative control under the communication-restricted environments, we propose a multi …
cooperative control under the communication-restricted environments, we propose a multi …
Safe reinforcement learning under temporal logic with reward design and quantum action selection
This paper proposes an advanced Reinforcement Learning (RL) method, incorporating
reward-shaping, safety value functions, and a quantum action selection algorithm. The …
reward-shaping, safety value functions, and a quantum action selection algorithm. The …
[HTML][HTML] Actor-critic continuous state reinforcement learning for wind-turbine control robust optimization
B Fernandez-Gauna, M Graña, JL Osa-Amilibia… - Information …, 2022 - Elsevier
Abstract The control of Variable-Speed Wind-Turbines (VSWT) extracting electrical power
from the wind kinetic energy are composed of subsystems that need to be controlled jointly …
from the wind kinetic energy are composed of subsystems that need to be controlled jointly …
Safety robustness of reinforcement learning policies: A view from robust control
For a reinforcement learning (RL) problem without a specified reward function, one may
specify different reward functions to better guide an agent to learn. With different reward …
specify different reward functions to better guide an agent to learn. With different reward …
Autonomous flipper control with safety constraints
M Pecka, V Šalanský, K Zimmermann… - 2016 IEEE/RSJ …, 2016 - ieeexplore.ieee.org
Policy Gradient methods require many real-world trials. Some of the trials may endanger the
robot system and cause its rapid wear. Therefore, a safe or at least gentle-to-wear …
robot system and cause its rapid wear. Therefore, a safe or at least gentle-to-wear …
Online fuzzy modulated adaptive PD control for cooperative aerial transportation of deformable linear objects
The aim of this work is to design robust control algorithms of aerial robots, ie quadrotors, for
team transportation of a deformable linear object (DLO). The DLO-robot attachment makes …
team transportation of a deformable linear object (DLO). The DLO-robot attachment makes …
Particle swarm optimization quadrotor control for cooperative aerial transportation of deformable linear objects
We present a cooperative aerial robot system for the transportation of hoses. The hose–
robot attachment makes the whole system physically interconnected but not rigid, so that …
robot attachment makes the whole system physically interconnected but not rigid, so that …
Modular neural network via exploring category hierarchy
Modular is a powerful and inherently hierarchical concept in the human brain to process a
large variety of complex tasks. Converging evidence has shown several advantages to …
large variety of complex tasks. Converging evidence has shown several advantages to …
Decentralized opportunistic spectrum resources access model and algorithm toward cooperative ad-hoc networks
Limited communication resources have gradually become a critical factor toward efficiency
of decentralized large scale multi-agent coordination when both system scales up and tasks …
of decentralized large scale multi-agent coordination when both system scales up and tasks …
Safe reward‐based deep reinforcement learning control for an electro‐hydraulic servo system
M Wu, L Liu, Z Yu, W Li - International Journal of Robust and …, 2022 - Wiley Online Library
In this article, a safe deep reinforcement learning (DRL) control method based on a safe
reward shaping method is proposed and applied to the constrained control for an electro …
reward shaping method is proposed and applied to the constrained control for an electro …