关注
Wenhao Yang
Wenhao Yang
在 stanford.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
On the Convergence of FedAvg on Non-IID Data
X Li, K Huang, W Yang, S Wang, Z Zhang
arXiv preprint arXiv:1907.02189, 2019
23242019
Communication-efficient local decentralized SGD methods
X Li, W Yang, S Wang, Z Zhang
arXiv preprint arXiv:1910.09126, 2019
112*2019
Toward theoretical understandings of robust Markov decision processes: Sample complexity and asymptotics
W Yang, L Zhang, Z Zhang
The Annals of Statistics 50 (6), 3223-3248, 2022
592022
Federated Reinforcement Learning with Environment Heterogeneity
H Jin, Y Peng, W Yang, S Wang, Z Zhang
International Conference on Artificial Intelligence and Statistics, 18-37, 2022
562022
A regularized approach to sparse optimal policy in reinforcement learning
W Yang, X Li, Z Zhang
Advances in Neural Information Processing Systems 32, 2019
36*2019
A Statistical Analysis of Polyak-Ruppert Averaged Q-Learning
X Li, W Yang, J Liang, Z Zhang, MI Jordan
International Conference on Artificial Intelligence and Statistics, 2207-2261, 2023
18*2023
Robust Markov Decision Processes without Model Estimation
W Yang, H Wang, T Kozuno, SM Jordan, Z Zhang
arXiv preprint arXiv:2302.01248, 2023
10*2023
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ...
arXiv preprint arXiv:2205.14211, 2022
72022
Semiparametrically efficient off-policy evaluation in linear Markov decision processes
C Xie, W Yang, Z Zhang
International Conference on Machine Learning, 38227-38257, 2023
42023
Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs
W Yang, X Li, G Xie, Z Zhang
arXiv preprint arXiv:2011.00213, 2020
32020
Regularization and variance-weighted regression achieves minimax optimality in linear MDPs: theory and practice
T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ...
International Conference on Machine Learning, 17135-17175, 2023
22023
Semi-infinitely Constrained Markov Decision Processes
L Zhang, Y Peng, W Yang, Z Zhang
Advances in Neural Information Processing Systems 35, 16808-16820, 2022
22022
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
M Lu, W Yang, L Zhang, Z Zhang
arXiv preprint arXiv:2209.05186, 2022
22022
Estimation and Inference in Distributional Reinforcement Learning
L Zhang, Y Peng, J Liang, W Yang, Z Zhang
arXiv preprint arXiv:2309.17262, 2023
12023
Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions
PK Kuiper, A Hasan, W Yang, J Blanchet, V Tarokh, Y Ng, H Bidkhori
The 40th Conference on Uncertainty in Artificial Intelligence, 2024
2024
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning
L Zhang, Y Peng, W Yang, Z Zhang
IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-14, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–16