Knowledge consistency between neural networks and beyond R Liang, T Li, LF Li, J Wang, Q Zhang The Eighth International Conference on Learning Representations (ICLR), 2020 | 38 | 2020 |
Dynamic regret of online markov decision processes P Zhao, LF Li, ZH Zhou The 38th International Conference on Machine Learning (ICML), 26865-26894, 2022 | 15 | 2022 |
Dynamic Regret of Adversarial Linear Mixture MDPs LF Li, P Zhao, ZH Zhou Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS), 2023 | 4 | 2023 |
Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition LF Li, P Zhao, ZH Zhou The 27th International Conference on Artificial Intelligence and Statistics …, 2024 | 1 | 2024 |
Tracking treatment effect heterogeneity in evolving environments T Qin, LF Li, TZ Wang, ZH Zhou Machine Learning, 1-21, 2024 | 1 | 2024 |
Dynamic Regret of Adversarial MDPs with Unknown Transition and Linear Function Approximation LF Li, P Zhao, ZH Zhou The 38th Annual AAAI Conference on Artificial Intelligence (AAAI), 2024 | 1 | 2024 |
Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation LF Li, YJ Zhang, P Zhao, ZH Zhou arXiv preprint arXiv:2405.17061, 2024 | | 2024 |