Who is the strongest enemy? towards optimal and efficient evasion attacks in deep rl Y Sun, R Zheng, Y Liang, F Huang ICLR 2022, 2021 | 64 | 2021 |
Efficient adversarial training without attacking: Worst-case-aware robust reinforcement learning Y Liang, Y Sun, R Zheng, F Huang Advances in Neural Information Processing Systems 35, 22547-22561, 2022 | 42 | 2022 |
Certifiably Robust Policy Learning against Adversarial Multi-Agent Communication Y Sun, R Zheng, P Hassanzadeh, Y Liang, S Feizi, S Ganesh, F Huang The Eleventh International Conference on Learning Representations, 2022 | 24* | 2022 |
TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning R Zheng, X Wang, Y Sun, S Ma, J Zhao, H Xu, H Daumé III, F Huang Advances in Neural Information Processing Systems 36, 2024, 2023 | 20* | 2023 |
Transfer RL across observation feature spaces via model-based regularization Y Sun, R Zheng, X Wang, A Cohen, F Huang The Eleventh International Conference on Learning Representations, 2022 | 19 | 2022 |
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function R Zheng, X Wang, H Xu, F Huang The Eleventh International Conference on Learning Representations, 2023 | 13 | 2023 |
Is imitation all you need? generalized decision-making with dual-phase training Y Wei, Y Sun, R Zheng, S Vemprala, R Bonatti, S Chen, R Madaan, Z Ba, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 12 | 2023 |
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization G Xu, R Zheng, Y Liang, X Wang, Z Yuan, T Ji, Y Luo, X Liu, J Yuan, ... The Twelfth International Conference on Learning Representations (ICLR 2024), 2023 | 11 | 2023 |
Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making Y Xu, C Deng, Y Sun, R Zheng, X Wang, J Zhao, F Huang arXiv preprint arXiv:2309.03426, 2023 | 4 | 2023 |
Game-theoretic robust reinforcement learning handles temporally-coupled perturbations Y Liang, Y Sun, R Zheng, X Liu, T Sandholm, F Huang, S McAleer The Twelfth International Conference on Learning Representations (ICLR 2024), 2023 | 4 | 2023 |
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss R Zheng, Y Liang, X Wang, S Ma, H Daumé III, H Xu, J Langford, ... arXiv preprint arXiv:2402.06187, 2024 | 3 | 2024 |
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL X Wang, R Zheng, Y Sun, R Jia, W Wongkamjan, H Xu, F Huang arXiv preprint arXiv:2310.07220, 2023 | 3 | 2023 |
PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem R Zheng, CA Cheng, H Daumé III, F Huang, A Kolobov arXiv preprint arXiv:2402.10450, 2024 | 1 | 2024 |
ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization T Ji, Y Liang, Y Zeng, Y Luo, G Xu, J Guo, R Zheng, F Huang, F Sun, H Xu arXiv preprint arXiv:2402.14528, 2024 | | 2024 |
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control R Zheng, CA Cheng, H Daumé III, F Huang, A Kolobov Forty-first International Conference on Machine Learning, 0 | | |
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate Y Xu, C Deng, Y Sun, R Zheng, X Wang, J Zhao, F Huang Forty-first International Conference on Machine Learning, 0 | | |