A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

Bootstrapped transformer for offline reinforcement learning

K Wang, H Zhao, X Luo, K Ren… - Advances in Neural …, 2022 - proceedings.neurips.cc
Offline reinforcement learning (RL) aims at learning policies from previously collected static
trajectory data without interacting with the real environment. Recent works provide a novel …

Understanding world models through multi-step pruning policy via reinforcement learning

Z He, W Qiu, W Zhao, X Shao, Z Liu - Information Sciences, 2025 - Elsevier
In model-based reinforcement learning, the conventional approach to addressing world
model bias is to use gradient optimization methods. However, using a singular policy from …

Empirical prior based probabilistic inference neural network for policy learning

Y Li, S Guo, Z Gan - Information Sciences, 2022 - Elsevier
Reinforcement learning is very much democratized for autonomous control of an unknown
dynamics system. However, low data efficiency is a practical concern in physical systems …

[PDF][PDF] Automated cryptocurrency trading bot implementing DRL

A Peng, SL Ang, CY Lim - Pertanika Journal of Science and …, 2022 - pertanika2.upm.edu.my
ABSTRACT A year ago, one thousand USD invested in Bitcoin (BTC) alone would have
appreciated to three thousand five hundred USD. Deep reinforcement learning (DRL) recent …

RITA: Boost driving simulators with realistic interactive traffic flow

Z Zhu, S Zhang, Y Zhuang, Y Liu, M Liu… - Proceedings of the Fifth …, 2023 - dl.acm.org
High-quality traffic flow generation is the core module in building simulators for autonomous
driving. However, the majority of available simulators are incapable of replicating traffic …

Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning

W Zhao, T He, F Li, C Liu - arXiv preprint arXiv:2405.02754, 2024 - arxiv.org
Deep reinforcement learning (DRL) has demonstrated remarkable performance in many
continuous control tasks. However, a significant obstacle to the real-world application of …

RITA: Boost Autonomous Driving Simulators with Realistic Interactive Traffic Flow

Z Zhu, S Zhang, Y Zhuang, Y Liu, M Liu, L Mao… - arXiv preprint arXiv …, 2022 - arxiv.org
High-quality traffic flow generation is the core module in building simulators for autonomous
driving. However, the majority of available simulators are incapable of replicating traffic …

Adversarially Trained Environment Models Are Effective Policy Evaluators and Improvers-An Application to Information Retrieval

Y Li, Y Liu, X Dai, J Lin, H Lai, Y Liu, Y Yu - Proceedings of the Fifth …, 2023 - dl.acm.org
The essence of information retrieval (IR) is to find the most useful information items (or
documents) according to the user's information need and present the items to the users in …

Modelling Pedestrians in Autonomous Vehicle Testing

M Priisalu - 2023 - portal.research.lu.se
Realistic modelling of pedestrians in Autonomous Vehicles (AV) s and AV testing is crucial
to avoid lethal collisions in deployment. The majority of AV trajectory forecasting literature do …