A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

Error bounds of imitating policies and environments

T Xu, Z Li, Y Yu - Advances in Neural Information …, 2020 - proceedings.neurips.cc
Imitation learning trains a policy by mimicking expert demonstrations. Various imitation
methods were proposed and empirically evaluated, meanwhile, their theoretical …

Maximum-likelihood inverse reinforcement learning with finite-time guarantees

S Zeng, C Li, A Garcia, M Hong - Advances in Neural …, 2022 - proceedings.neurips.cc
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

Self-directed online machine learning for topology optimization

C Deng, Y Wang, C Qin, Y Fu, W Lu - Nature communications, 2022 - nature.com
Topology optimization by optimally distributing materials in a given domain requires non-
gradient optimizers to solve highly complicated problems. However, with hundreds of design …

Learning from demonstration: Provably efficient adversarial policy imitation with linear function approximation

Z Liu, Y Zhang, Z Fu, Z Yang… - … conference on machine …, 2022 - proceedings.mlr.press
In generative adversarial imitation learning (GAIL), the agent aims to learn a policy from an
expert demonstration so that its performance cannot be discriminated from the expert policy …

Making reconstruction-based method great again for video anomaly detection

Y Wang, C Qin, Y Bai, Y Xu, X Ma… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Anomaly detection in videos is a significant yet challenging problem. Previous approaches
based on deep neural networks employ either reconstruction-based or prediction-based …

Diffusion model-augmented behavioral cloning

SF Chen, HC Wang, MH Hsu, CM Lai… - arXiv preprint arXiv …, 2023 - arxiv.org
Imitation learning addresses the challenge of learning by observing an expert's
demonstrations without access to reward signals from environments. Most existing imitation …

Proximal point imitation learning

L Viano, A Kamoutsi, G Neu… - Advances in Neural …, 2022 - proceedings.neurips.cc
This work develops new algorithms with rigorous efficiency guarantees for infinite horizon
imitation learning (IL) with linear function approximation without restrictive coherence …

Error bounds of imitating policies and environments for reinforcement learning

T Xu, Z Li, Y Yu - IEEE Transactions on Pattern Analysis and …, 2021 - ieeexplore.ieee.org
In sequential decision-making, imitation learning (IL) trains a policy efficiently by mimicking
expert demonstrations. Various imitation methods were proposed and empirically evaluated …

When demonstrations meet generative world models: A maximum likelihood framework for offline inverse reinforcement learning

S Zeng, C Li, A Garcia, M Hong - Advances in Neural …, 2023 - proceedings.neurips.cc
Offline inverse reinforcement learning (Offline IRL) aims to recover the structure of rewards
and environment dynamics that underlie observed actions in a fixed, finite set of …