Safe dreamerv3: Safe reinforcement learning with world models

J Dai, X Pan, R Sun, J Ji, X Xu, M Liu, Y Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

With the development of large language models (LLMs), striking a balance between the
performance and safety of AI systems has never been more critical. However, the inherent …

被引用次数：118 相关文章所有 3 个版本

[PDF] arxiv.org

Is sora a world simulator? a comprehensive survey on general world models and beyond

Z Zhu, X Wang, W Zhao, C Min, N Deng, M Dou… - arXiv preprint arXiv …, 2024 - arxiv.org

General world models represent a crucial pathway toward achieving Artificial General
Intelligence (AGI), serving as the cornerstone for various applications ranging from virtual …

被引用次数：12 相关文章所有 3 个版本

[PDF] arxiv.org

World models for autonomous driving: An initial survey

Y Guan, H Liao, Z Li, J Hu, R Yuan, Y Li… - IEEE Transactions …, 2024 - ieeexplore.ieee.org

In the rapidly evolving landscape of autonomous driving, the capability to accurately predict
future events and assess their implications is paramount for both safety and efficiency …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

Robust Training of Federated Models with Extremely Label Deficiency

Y Zhang, Z Yang, X Tian, N Wang, T Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

Federated semi-supervised learning (FSSL) has emerged as a powerful paradigm for
collaboratively training machine learning models using distributed data with label deficiency …

Feasibility Consistent Representation Learning for Safe Reinforcement Learning

Z Cen, Y Yao, Z Liu, D Zhao - arXiv preprint arXiv:2405.11718, 2024 - arxiv.org

In the field of safe reinforcement learning (RL), finding a balance between satisfying safety
constraints and optimizing reward performance presents a significant challenge. A key …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org