Neural Laplace control for continuous-time delayed systems

S Holt, A Hüyük, Z Qian, H Sun… - International …, 2023 - proceedings.mlr.press
Many real-world offline reinforcement learning (RL) problems involve continuous-time
environments with delays. Such environments are characterized by two distinctive features …

Boosting Long-Delayed Reinforcement Learning with Auxiliary Short-Delayed Task

Q Wu, SS Zhan, Y Wang, CW Lin, C Lv, Q Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement learning is challenging in delayed scenarios, a common real-world situation
where observations and interactions occur with delays. State-of-the-art (SOTA) state …

Delays in reinforcement learning

P Liotet - arXiv preprint arXiv:2309.11096, 2023 - arxiv.org
Delays are inherent to most dynamical systems. Besides shifting the process in time, they
can significantly affect their performance. For this reason, it is usually valuable to study the …

A delay-robust method for enhanced real-time reinforcement learning

B Xia, H Sun, B Yuan, Z Li, B Liang, X Wang - Neural Networks, 2024 - Elsevier
In reinforcement learning, the Markov Decision Process (MDP) framework typically operates
under a blocking paradigm, assuming a static environment during the agent's decision …

[HTML][HTML] A pipelining task offloading strategy via delay-aware multi-agent reinforcement learning in Cybertwin-enabled 6G network

H Niu, L Wang, K Du, Z Lu, X Wen, Y Liu - Digital Communications and …, 2023 - Elsevier
Abstract Cybertwin-enabled 6th Generation (6G) network is envisioned to support artificial
intelligence-native management to meet changing demands of 6G applications. Multi-Agent …

Variational Delayed Policy Optimization

Q Wu, SS Zhan, Y Wang, Y Wang, CW Lin, C Lv… - arXiv preprint arXiv …, 2024 - arxiv.org
In environments with delayed observation, state augmentation by including actions within
the delay window is adopted to retrieve Markovian property to enable reinforcement learning …

Adaptive PD Control Using Deep Reinforcement Learning for Local-Remote Teleoperation with Stochastic Time Delays

L McCutcheon, S Fallah - 2023 IEEE/RSJ International …, 2023 - ieeexplore.ieee.org
Local-remote systems allow robots to execute complex tasks in hazardous environments
such as space and nuclear power stations. However, establishing accurate positional …

Overcoming Delayed Feedback via Overlook Decision Making

YL Yu, B Xia, M Xie, X Wang, Z Li… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Reinforcement learning is one of the most general paradigms to solve sequential decision
making issues on the assumption that the action selection and environmental feedback are …

Dynamic Modeling for Reinforcement Learning with Random Delay

Y Yu, B xia, M Xie, Z Li, X Wang - International Conference on Artificial …, 2024 - Springer
Delays in real-world tasks degrade the performance of standard reinforcement learning (RL)
which is based on the assumption that environmental feedback and action selection are …

Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays

Q Wu, SS Zhan, Y Wang, Y Wang, CW Lin, C Lv… - Forty-first International … - openreview.net
Reinforcement learning (RL) is challenging in the common case of delays between events
and their sensory perceptions. State-of-the-art (SOTA) state augmentation techniques either …