RLlib flow: distributed reinforcement learning is a dataflow problem

Y Li - arXiv preprint arXiv:2202.11296, 2022 - arxiv.org

This article is a gentle discussion about the field of reinforcement learning in practice, about
opportunities and challenges, touching a broad range of topics, with perspectives and …

被引用次数：27 相关文章所有 2 个版本

[PDF] usenix.org

{MSRL}: Distributed Reinforcement Learning with Dataflow Fragments

H Zhu, B Zhao, G Chen, W Chen, Y Chen… - 2023 USENIX Annual …, 2023 - usenix.org

A wide range of reinforcement learning (RL) algorithms have been proposed, in which
agents learn from interactions with a simulated environment. Executing such RL training …

被引用次数：9 相关文章所有 12 个版本

[PDF] acm.org

Efficient parallel reinforcement learning framework using the reactor model

J Kwok, M Lohstroh, EA Lee - Proceedings of the 36th ACM Symposium …, 2024 - dl.acm.org

Parallel Reinforcement Learning (RL) frameworks are essential for mapping RL workloads
to multiple computational resources, allowing for faster generation of samples, estimation of …

被引用次数：2 相关文章

[PDF] arxiv.org

Hybridflow: A flexible and efficient rlhf framework

G Sheng, C Zhang, Z Ye, X Wu, W Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Reinforcement Learning from Human Feedback (RLHF) is widely used in Large Language
Model (LLM) alignment. Traditional RL can be modeled as a dataflow, where each node …

被引用次数：1 相关文章所有 3 个版本

[PDF] acm.org

An empirical evaluation of flow based programming in the machine learning deployment context

A Paleyes, C Cabrera, ND Lawrence - Proceedings of the 1st …, 2022 - dl.acm.org

As use of data driven technologies spreads, software engineers are more often faced with
the task of solving a business problem using data-driven methods such as machine learning …

被引用次数：8 相关文章所有 7 个版本

[PDF] arxiv.org

AI-coupled HPC Workflow Applications, Middleware and Performance

W Brewer, A Gainaru, F Suter, F Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

AI integration is revolutionizing the landscape of HPC simulations, enhancing the
importance, use, and performance of AI-driven HPC workflows. This paper surveys the …

被引用次数：7 相关文章

[PDF] zhenxiao.com

Optimizing communication in deep reinforcement learning with XingTian

L Pan, J Qian, W Xia, H Mao, J Yao, P Li… - Proceedings of the 23rd …, 2022 - dl.acm.org

Deep Reinforcement Learning (DRL) achieves great success in various domains.
Communication in today's DRL algorithms takes non-negligible time compared to the …

被引用次数：1 相关文章所有 2 个版本

[PDF] openproceedings.org

[PDF][PDF] Distributed Training of Knowledge Graph Embedding Models using Ray.

N Sheikh, X Qin, Y Gur, B Reinwald, Q Xiang, H Yu - EDBT, 2022 - openproceedings.org

Knowledge graphs are at the core of numerous consumer and enterprise applications where
learned graph embeddings are used to derive insights for the users of these applications …

被引用次数：3 相关文章所有 3 个版本

SCDRL: Scalable and Customized Distributed Reinforcement Learning System

C Sun, W Qiang, J Li - 2024 IEEE 44th International …, 2024 - ieeexplore.ieee.org

Reinforcement Learning (RL) has marked significant achievements across a variety of
complex tasks in real-world scenarios. However, the efficacy of RL predominantly relies on …

Learning human like driving policies from real interactive driving scenes

Y Koeberle, S Sabatini, D Tsishkou, C Sabourin - 2022 - hal.science

Traffic simulation has gained a lot of interest for massive safety evaluation of self-driving
systems in a risk free setting but the reality gap remains a big challenge. Adversarial …

被引用次数：3 相关文章所有 22 个版本