Toward theoretical understandings of robust markov decision processes: Sample complexity and asymptotics W Yang, L Zhang, Z Zhang The Annals of Statistics 50 (6), 3223-3248, 2022 | 58 | 2022 |
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning L Zhang, Y Peng, W Yang, Z Zhang IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 3* | 2024 |
Estimation and Inference in Distributional Reinforcement Learning L Zhang, Y Peng, J Liang, W Yang, Z Zhang arXiv preprint arXiv:2309.17262, 2023 | 1 | 2023 |
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach M Lu, W Yang, L Zhang, Z Zhang arXiv preprint arXiv:2209.05186, 2022 | 1 | 2022 |
Intervention Generative Adversarial Networks J Liang, L Zhang, C Zhang, Z Zhang arXiv preprint arXiv:2008.03712, 2020 | 1 | 2020 |
Federated Control in Markov Decision Processes H Jin, Y Peng, L Zhang, Z Zhang arXiv preprint arXiv:2405.04026, 2024 | | 2024 |
Federated Reinforcement Learning with Constraint Heterogeneity H Jin, L Zhang, Z Zhang arXiv preprint arXiv:2405.03236, 2024 | | 2024 |
Statistical Efficiency of Distributional Temporal Difference Y Peng, L Zhang, Z Zhang arXiv preprint arXiv:2403.05811, 2024 | | 2024 |