On the Convergence of FedAvg on Non-IID Data X Li, K Huang, W Yang, S Wang, Z Zhang arXiv preprint arXiv:1907.02189, 2019 | 2324 | 2019 |
Communication-efficient local decentralized SGD methods X Li, W Yang, S Wang, Z Zhang arXiv preprint arXiv:1910.09126, 2019 | 112* | 2019 |
Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics W Yang, L Zhang, Z Zhang The Annals of Statistics 50 (6), 3223-3248, 2022 | 59 | 2022 |
Federated Reinforcement Learning with Environment Heterogeneity H Jin, Y Peng, W Yang, S Wang, Z Zhang International Conference on Artificial Intelligence and Statistics, 18-37, 2022 | 56 | 2022 |
A regularized approach to sparse optimal policy in reinforcement learning W Yang, X Li, Z Zhang Advances in Neural Information Processing Systems 32, 2019 | 36* | 2019 |
A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning X Li, W Yang, J Liang, Z Zhang, MI Jordan International Conference on Artificial Intelligence and Statistics, 2207-2261, 2023 | 18* | 2023 |
Robust Markov Decision Processes without Model Estimation W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang arXiv preprint arXiv:2302.01248, 2023 | 10* | 2023 |
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ... arXiv preprint arXiv:2205.14211, 2022 | 7 | 2022 |
Semiparametrically efficient off-policy evaluation in linear Markov decision processes C Xie, W Yang, Z Zhang International Conference on Machine Learning, 38227-38257, 2023 | 4 | 2023 |
Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs W Yang, X Li, G Xie, Z Zhang arXiv preprint arXiv:2011.00213, 2020 | 3 | 2020 |
Regularization and variance-weighted regression achieves minimax optimality in linear MDPs: theory and practice T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ... International Conference on Machine Learning, 17135-17175, 2023 | 2 | 2023 |
Semi-infinitely Constrained Markov Decision Processes L Zhang, Y Peng, W Yang, Z Zhang Advances in Neural Information Processing Systems 35, 16808-16820, 2022 | 2 | 2022 |
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach M Lu, W Yang, L Zhang, Z Zhang arXiv preprint arXiv:2209.05186, 2022 | 2 | 2022 |
Estimation and Inference in Distributional Reinforcement Learning L Zhang, Y Peng, J Liang, W Yang, Z Zhang arXiv preprint arXiv:2309.17262, 2023 | 1 | 2023 |
Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions PK Kuiper, A Hasan, W Yang, J Blanchet, V Tarokh, Y Ng, H Bidkhori The 40th Conference on Uncertainty in Artificial Intelligence, 2024 | | 2024 |
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning L Zhang, Y Peng, W Yang, Z Zhang IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-14, 2023 | | 2023 |