Reinforcement learning endowed with safe veto policies to learn the control of Linked-Multicompon...

Graph attention mechanism based reinforcement learning for multi-agent flocking control in communication-restricted environment

J Xiao, G Yuan, J He, K Fang, Z Wang - Information Sciences, 2023 - Elsevier

To solve the poor performance of reinforcement learning (RL) in the multi-agent flocking
cooperative control under the communication-restricted environments, we propose a multi …

被引用次数：23 相关文章所有 2 个版本

[PDF] nature.com

Safe reinforcement learning under temporal logic with reward design and quantum action selection

M Cai, S Xiao, J Li, Z Kan - Scientific reports, 2023 - nature.com

This paper proposes an advanced Reinforcement Learning (RL) method, incorporating
reward-shaping, safety value functions, and a quantum action selection algorithm. The …

被引用次数：15 相关文章所有 6 个版本

[HTML] sciencedirect.com

[HTML][HTML] Actor-critic continuous state reinforcement learning for wind-turbine control robust optimization

B Fernandez-Gauna, M Graña, JL Osa-Amilibia… - Information …, 2022 - Elsevier

Abstract The control of Variable-Speed Wind-Turbines (VSWT) extracting electrical power
from the wind kinetic energy are composed of subsystems that need to be controlled jointly …

被引用次数：27 相关文章所有 3 个版本

Safety robustness of reinforcement learning policies: A view from robust control

H Xiong, X Diao - Neurocomputing, 2021 - Elsevier

For a reinforcement learning (RL) problem without a specified reward function, one may
specify different reward functions to better guide an agent to learn. With different reward …

被引用次数：19 相关文章

[PDF] cvut.cz

Autonomous flipper control with safety constraints

M Pecka, V Šalanský, K Zimmermann… - 2016 IEEE/RSJ …, 2016 - ieeexplore.ieee.org

Policy Gradient methods require many real-world trials. Some of the trials may endanger the
robot system and cause its rapid wear. Therefore, a safe or at least gentle-to-wear …

被引用次数：30 相关文章所有 4 个版本

[PDF] researchgate.net

Online fuzzy modulated adaptive PD control for cooperative aerial transportation of deformable linear objects

J Estevez, M Graña… - Integrated Computer …, 2017 - content.iospress.com

The aim of this work is to design robust control algorithms of aerial robots, ie quadrotors, for
team transportation of a deformable linear object (DLO). The DLO-robot attachment makes …

被引用次数：24 相关文章所有 3 个版本

Particle swarm optimization quadrotor control for cooperative aerial transportation of deformable linear objects

J Estevez, JM Lopez-Guede, M Graña - Cybernetics and Systems, 2016 - Taylor & Francis

We present a cooperative aerial robot system for the transportation of hoses. The hose–
robot attachment makes the whole system physically interconnected but not rigid, so that …

被引用次数：23 相关文章所有 3 个版本

[PDF] github.io

Modular neural network via exploring category hierarchy

W Han, C Zheng, R Zhang, J Guo, Q Yang, J Shao - Information Sciences, 2021 - Elsevier

Modular is a powerful and inherently hierarchical concept in the human brain to process a
large variety of complex tasks. Converging evidence has shown several advantages to …

被引用次数：6 相关文章所有 3 个版本

[PDF] plos.org

Decentralized opportunistic spectrum resources access model and algorithm toward cooperative ad-hoc networks

M Liu, Y Xu, AW Mohammed - PloS one, 2016 - journals.plos.org

Limited communication resources have gradually become a critical factor toward efficiency
of decentralized large scale multi-agent coordination when both system scales up and tasks …

被引用次数：15 相关文章所有 12 个版本

Safe reward‐based deep reinforcement learning control for an electro‐hydraulic servo system

M Wu, L Liu, Z Yu, W Li - International Journal of Robust and …, 2022 - Wiley Online Library

In this article, a safe deep reinforcement learning (DRL) control method based on a safe
reward shaping method is proposed and applied to the constrained control for an electro …

被引用次数：1 相关文章所有 2 个版本