Reinforcement learning in practice: Opportunities and challenges

Y Li - arXiv preprint arXiv:2202.11296, 2022 - arxiv.org
This article is a gentle discussion about the field of reinforcement learning in practice, about
opportunities and challenges, touching a broad range of topics, with perspectives and …

{MSRL}: Distributed Reinforcement Learning with Dataflow Fragments

H Zhu, B Zhao, G Chen, W Chen, Y Chen… - 2023 USENIX Annual …, 2023 - usenix.org
A wide range of reinforcement learning (RL) algorithms have been proposed, in which
agents learn from interactions with a simulated environment. Executing such RL training …

Efficient parallel reinforcement learning framework using the reactor model

J Kwok, M Lohstroh, EA Lee - Proceedings of the 36th ACM Symposium …, 2024 - dl.acm.org
Parallel Reinforcement Learning (RL) frameworks are essential for mapping RL workloads
to multiple computational resources, allowing for faster generation of samples, estimation of …

Hybridflow: A flexible and efficient rlhf framework

G Sheng, C Zhang, Z Ye, X Wu, W Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement Learning from Human Feedback (RLHF) is widely used in Large Language
Model (LLM) alignment. Traditional RL can be modeled as a dataflow, where each node …

An empirical evaluation of flow based programming in the machine learning deployment context

A Paleyes, C Cabrera, ND Lawrence - Proceedings of the 1st …, 2022 - dl.acm.org
As use of data driven technologies spreads, software engineers are more often faced with
the task of solving a business problem using data-driven methods such as machine learning …

AI-coupled HPC Workflow Applications, Middleware and Performance

W Brewer, A Gainaru, F Suter, F Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
AI integration is revolutionizing the landscape of HPC simulations, enhancing the
importance, use, and performance of AI-driven HPC workflows. This paper surveys the …

Optimizing communication in deep reinforcement learning with XingTian

L Pan, J Qian, W Xia, H Mao, J Yao, P Li… - Proceedings of the 23rd …, 2022 - dl.acm.org
Deep Reinforcement Learning (DRL) achieves great success in various domains.
Communication in today's DRL algorithms takes non-negligible time compared to the …

[PDF][PDF] Distributed Training of Knowledge Graph Embedding Models using Ray.

N Sheikh, X Qin, Y Gur, B Reinwald, Q Xiang, H Yu - EDBT, 2022 - openproceedings.org
Knowledge graphs are at the core of numerous consumer and enterprise applications where
learned graph embeddings are used to derive insights for the users of these applications …

SCDRL: Scalable and Customized Distributed Reinforcement Learning System

C Sun, W Qiang, J Li - 2024 IEEE 44th International …, 2024 - ieeexplore.ieee.org
Reinforcement Learning (RL) has marked significant achievements across a variety of
complex tasks in real-world scenarios. However, the efficacy of RL predominantly relies on …

Learning human like driving policies from real interactive driving scenes

Y Koeberle, S Sabatini, D Tsishkou, C Sabourin - 2022 - hal.science
Traffic simulation has gained a lot of interest for massive safety evaluation of self-driving
systems in a risk free setting but the reality gap remains a big challenge. Adversarial …