Learning to build high-fidelity and robust environment models

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer

Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

被引用次数：102 相关文章所有 4 个版本

[PDF] neurips.cc

Bootstrapped transformer for offline reinforcement learning

K Wang, H Zhao, X Luo, K Ren… - Advances in Neural …, 2022 - proceedings.neurips.cc

Offline reinforcement learning (RL) aims at learning policies from previously collected static
trajectory data without interacting with the real environment. Recent works provide a novel …

被引用次数：51 相关文章所有 7 个版本

Understanding world models through multi-step pruning policy via reinforcement learning

Z He, W Qiu, W Zhao, X Shao, Z Liu - Information Sciences, 2025 - Elsevier

In model-based reinforcement learning, the conventional approach to addressing world
model bias is to use gradient optimization methods. However, using a singular policy from …

被引用次数：1 相关文章所有 2 个版本

Empirical prior based probabilistic inference neural network for policy learning

Y Li, S Guo, Z Gan - Information Sciences, 2022 - Elsevier

Reinforcement learning is very much democratized for autonomous control of an unknown
dynamics system. However, low data efficiency is a practical concern in physical systems …

被引用次数：5 相关文章所有 2 个版本

[PDF] upm.edu.my

[PDF][PDF] Automated cryptocurrency trading bot implementing DRL

A Peng, SL Ang, CY Lim - Pertanika Journal of Science and …, 2022 - pertanika2.upm.edu.my

ABSTRACT A year ago, one thousand USD invested in Bitcoin (BTC) alone would have
appreciated to three thousand five hundred USD. Deep reinforcement learning (DRL) recent …

被引用次数：8 相关文章所有 5 个版本

RITA: Boost driving simulators with realistic interactive traffic flow

Z Zhu, S Zhang, Y Zhuang, Y Liu, M Liu… - Proceedings of the Fifth …, 2023 - dl.acm.org

High-quality traffic flow generation is the core module in building simulators for autonomous
driving. However, the majority of available simulators are incapable of replicating traffic …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning

W Zhao, T He, F Li, C Liu - arXiv preprint arXiv:2405.02754, 2024 - arxiv.org

Deep reinforcement learning (DRL) has demonstrated remarkable performance in many
continuous control tasks. However, a significant obstacle to the real-world application of …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

RITA: Boost Autonomous Driving Simulators with Realistic Interactive Traffic Flow

Z Zhu, S Zhang, Y Zhuang, Y Liu, M Liu, L Mao… - arXiv preprint arXiv …, 2022 - arxiv.org

High-quality traffic flow generation is the core module in building simulators for autonomous
driving. However, the majority of available simulators are incapable of replicating traffic …

被引用次数：3 相关文章所有 2 个版本

Adversarially Trained Environment Models Are Effective Policy Evaluators and Improvers-An Application to Information Retrieval

Y Li, Y Liu, X Dai, J Lin, H Lai, Y Liu, Y Yu - Proceedings of the Fifth …, 2023 - dl.acm.org

The essence of information retrieval (IR) is to find the most useful information items (or
documents) according to the user's information need and present the items to the users in …

Modelling Pedestrians in Autonomous Vehicle Testing

M Priisalu - 2023 - portal.research.lu.se

Realistic modelling of pedestrians in Autonomous Vehicles (AV) s and AV testing is crucial
to avoid lethal collisions in deployment. The majority of AV trajectory forecasting literature do …