Linear last-iterate convergence in constrained saddle-point optimization CY Wei, CW Lee, M Zhang, H Luo arXiv preprint arXiv:2006.09517, 2020 | 110 | 2020 |
Last-iterate convergence of decentralized optimistic gradient descent/ascent in infinite-horizon competitive markov games CY Wei, CW Lee, M Zhang, H Luo Conference on learning theory, 4259-4299, 2021 | 97 | 2021 |
Bias no more: high-probability data-dependent regret bounds for adversarial bandits and mdps CW Lee, H Luo, CY Wei, M Zhang Advances in neural information processing systems 33, 15522-15533, 2020 | 58 | 2020 |
Achieving near instance-optimality and minimax-optimality in stochastic and adversarial linear bandits simultaneously CW Lee, H Luo, CY Wei, M Zhang, X Zhang International Conference on Machine Learning, 6142-6151, 2021 | 46 | 2021 |
No-regret learning in time-varying zero-sum games M Zhang, P Zhao, H Luo, ZH Zhou International Conference on Machine Learning, 26772-26808, 2022 | 39 | 2022 |
A closer look at small-loss bounds for bandits with graph feedback CW Lee, H Luo, M Zhang Conference on Learning Theory, 2516-2564, 2020 | 22 | 2020 |
Autobidders with budget and roi constraints: Efficiency, regret, and pacing dynamics B Lucier, S Pattathil, A Slivkins, M Zhang The Thirty Seventh Annual Conference on Learning Theory, 3642-3643, 2024 | 17 | 2024 |
Linear last-iterate convergence for matrix games and stochastic games CW Lee, H Luo, CY Wei, M Zhang arXiv e-prints, arXiv: 2006.09517, 2020 | 16 | 2020 |
Corralling a larger band of bandits: A case study on switching regret for linear bandits H Luo, M Zhang, P Zhao, ZH Zhou Conference on Learning Theory, 3635-3684, 2022 | 14 | 2022 |
Random mask: Towards robust convolutional neural networks T Luo, T Cai, M Zhang, S Chen, L Wang arXiv preprint arXiv:2007.14249, 2020 | 14 | 2020 |
Improved high-probability regret for adversarial bandits with time-varying feedback graphs H Luo, H Tong, M Zhang, Y Zhang International Conference on Algorithmic Learning Theory, 1074-1100, 2023 | 5 | 2023 |
Adaptive bandit convex optimization with heterogeneous curvature H Luo, M Zhang, P Zhao Conference on Learning Theory, 1576-1612, 2022 | 5 | 2022 |
Advancing query rewriting in e-commerce via shopping intent learning M Zhang, Y Wu, R Rustamov, H Zhu, H Shi, Y Wu, L Tang, Z Zhang, ... | 4 | 2022 |
Practical contextual bandits with feedback graphs M Zhang, Y Zhang, O Vrousgou, H Luo, P Mineiro Advances in Neural Information Processing Systems 36, 2024 | 3 | 2024 |
Online learning in contextual second-price pay-per-click auctions M Zhang, H Luo International Conference on Artificial Intelligence and Statistics, 2395-2403, 2024 | 2 | 2024 |
Efficient contextual bandits with uninformed feedback graphs M Zhang, Y Zhang, H Luo, P Mineiro arXiv preprint arXiv:2402.08127, 2024 | 2 | 2024 |
No-regret learning in two-echelon supply chain with unknown demand distribution M Zhang, S Chen, H Luo, Y Wang International Conference on Artificial Intelligence and Statistics, 3270-3298, 2023 | 2 | 2023 |
Defective Convolutional Networks T Luo, T Cai, M Zhang, S Chen, D He, L Wang arXiv preprint arXiv:1911.08432, 2019 | 2 | 2019 |
Contextual Multinomial Logit Bandits with General Value Functions M Zhang, H Luo arXiv preprint arXiv:2402.08126, 2024 | 1 | 2024 |
No-Regret Learning for Fair Multi-Agent Social Welfare Optimization M Zhang, RDC Vuong, H Luo arXiv preprint arXiv:2405.20678, 2024 | | 2024 |