Online Policy Optimization for Robust Markov Decision Process J Dong, J Li, B Wang, J Zhang The 40th Conference on Uncertainty in Artificial Intelligence, 2024 | 11* | 2024 |
Combinatorial Bandits under Strategic Manipulations J Dong, K Li, S Li, B Wang Proceedings of the Fifteenth ACM International Conference on Web Search and …, 2022 | 10 | 2022 |
Algorithms and theory for supervised gradual domain adaptation J Dong, S Zhou, B Wang, H Zhao Transactions on Machine Learning Research (TMLR), 2022 | 5 | 2022 |
Learning to Control under Time-Varying Environment Y Han, R Solozabal, J Dong, X Zhou, M Takac, B Gu arXiv preprint arXiv:2206.02507, 2022 | 4 | 2022 |
Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games J Dong, J Wu, S Wang, B Wang, W Chen arXiv preprint arXiv:2306.13673, 2023 | 2 | 2023 |
Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization C Zhao, Y Ze, J Dong, B Wang, S Li International Conference on Web Search and Data Mining (WSDM), 2023 | 2 | 2023 |
Towards Black-Box Membership Inference Attack for Diffusion Models J Li, J Dong, T He, J Zhang arXiv preprint arXiv:2405.20771, 2024 | 1 | 2024 |
Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation J Dong, L Shen, Y Xu, B Wang International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2023 | 1 | 2023 |
Cascading Bandit Under Differential Privacy K Wang, J Dong, B Wang, S Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 1 | 2022 |
Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback J Dong, B Wang, Y Yu arXiv preprint arXiv:2408.08395, 2024 | | 2024 |
Convergence to Nash Equilibrium and No-regret Guarantee in (Markov) Potential Games J Dong, B Wang, Y Yu International Conference on Artificial Intelligence and Statistics, 2044-2052, 2024 | | 2024 |
A batch-to-online transformation under random-order model J Dong, Y Yoshida Advances in Neural Information Processing Systems 36, 2024 | | 2024 |
Learning in Domain Randomization via Continuous Time Non-Stochastic Control J Li, J Dong, C Chang, B Wang, J Zhang arXiv preprint arXiv:2306.01952, 2023 | | 2023 |